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TTTGCAGAAC TTGCGCGCAC GTGaGTGGTC GTGAGAATGG 
ATTCGGGTAT ATTTCCTTTG GAAGAcTTGA cAGCctGCGT 
GCAAAATACC GGACTTGCGC GCCCGCGGTG GGACGCAAGA 
AATGTTGGTT TGATGGTGTT CAGGTGGGTC CCCTTTCAAG 
CCCGCGCTGG TTGTGCGCGG TGCCTGAGAG GCCCGTTTGG 
CGTTTCGCTT TTTTGTGCCT CCTTGTTGCG TGCGGGCGTG 
CCACAGATCT TTTGAGGAGG ATTTTCATGG CCAAGGAAAA 
ACATGAACGT GGGTACTATT GGGCACGTCG ATCACGGGAA 
TCACCTCGTA CTGTGCAAAG AAGTTCGGTG ATAAGCAACT 
ATGCGCCCGA AGAGAAAGCG CGCGGGATCA CCATTAACAC 
CCGATCGTCG TCATTACGCG CATATTGATT GTCCTGGGCA 
TGATCACGGG TGcTGCGCAG ATGGACGGTG GTATTCTCGT 
ATGCCACAGA CGAAGgAGCA TCTTCTGCTC GCCCGTC AG g 
GTTTTTTTGA ACAAGGTTGA TTTGGTTGAT GATCCTGAGT 
GAGGTGCGTG ATGCGCTTGC TGGATATGGG TTTTCGCGTG 
TCTGCGTTTA AAGCTCTGCA GGATGGCGCT TCCCCGGAGG 
CTGCTTGCGG CCATGGATTC CTACTTTGAA GACCCAGTGC 
TTGCTCTCTA TCGAGGATGT GTACACTATT TCTGGGCGTG 
ATCGAATGTG GGGTAATTAG TCTGAATGAA GAGGTCGAGA 
AAGAAAACAG TGGTTACTGG CATTGAGATG TTTAATAAGT 
GGTGATAACG TGGGGCTGCT TTTGCGCGGG GTGGATAAAA 
GTGCTTTCTA AGCCCGGTTC TATTAAGCCA CACACCAAGT 
CTCTCTAAGG AAGAGGGTGG CCGTCACAGT CCTTTTTTTC 
TATTTTAGAA CTACTGACAT TACCGGTACG ATTTCTCTTC 
AAGCCGGGGG ATAACACCAA GATtATAGGT GAGCTCATCC 
GGTCTGAAGC TTGCGATTCG TGAArGGGGG CGCACTATTG 
TTTtGTTGTA GGCGTTTGCG GCGCGGAGTG TGTTTGGAGT 
GTTTTAGGCT GATGGAGGGG TTATGGCCAG GGAGAGAATT 
TGACGTGGAG CTAGTGGATC AAAGTTCGCG CGCGATCGTG 
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GTACGCCGTG TACTATTTGA 8880 

TTGTCTTATT ACCATCGGCC 8940 

GGATGCCAAA CCAGTCATCA 9000 

CGTGCGCAAT GCGCTTGGAT 9060 

TTGTAGCGCT GTCGATCCTT 9120 

CGGGGTCCGC CGTATCTGTG 9180 

GTTCGCGCGC ACTAAAGTTC 9240 

GACAACGCTC TCTGCGGCGA 9300 

AAAATACGAC GAGATTGACA 9360 

GCGTCATCTT GAGTATCAGT 9420 

CGCGGACTAT GTGAAGAATA 9480 

CGTGTCTGgC tGACGGCGTT 9540 

TTGGTGTTCC CTCCATCATT 9600 

TGCTAGAGCT GGTGGAAGAA 9660 

AGACGCCTAT CGTCAAGGGG 9720 

ATGCAGCTTG TATTGAGGAA 9780 

GTGACGACGC AAGACCTTTC 9840 

GTACCGTTGT CACGGGGCGC 9900 

TCGTCGGGAT TAAGCCCACT 9960 

TGCTTGATCA GGGAATTGCA 10020 

AAGAGGTTGA GCGCGGTCAG 10080 

TTGAGGCGCA GATCTACGTG 10140 

AAGGTTATCG TCCGCAGTTT 10200 

CTGAAGGGGT AGACATGGTG 10260 

ACCCGATAGC TATGGACAAG 10320 

CTTCTGGTCA GtGACAGAGA 10380 

TATTTTGCAA GGTGGGTGCG 10440 

CGGGTAAAAC TGTGCGGATT 10500 

CACGCGGTGC AGAAGGCGGG 10560 
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CGCTGAGGTG 


CTCGGACCTA 


TTCCGCTTCC 


GACTAGGATG 


CACAAGTTTA 


CGGTCTTGCG 


10620 


CTCTCCTCAT 


GTGAACAAGA 


AGTCGAGGGA 


ACAGTTTGAG 


ATGCGTACGC 


ACAAGCGGCT 


10680 


GATTGATATC 


ATCGAACCTT 


CTCAGGAAGT 


GATGAATGCG 


CTTATGGGTT 


TAGAGCTTTC 


10740 


TGCAGGAGTG 


GATGTGCGGA 


TAAAGCAGTG 


AGGCGTGTGT 


GTTTTGTCTG 


TGCGTTGCGA 


10800 


TACGGAAGAG 


GTAGGTGATG 


GTTGGTTTAA 


TCGGCCAGAA 


AGTTGGTATG 


ACCCAGATTT 


10860 


TTGACGCaCG 


GGGTTGTGTT 


ACGCCGGTGA 


CGGTGATTCG 


GGTGGAGCAC 


AACGTGGTGG 


10920 


TAGGACTGAA 


GGATGTGGAG 


CGCTTCGGTT 


ACTCTGCaGT 


GATACTTGGC 


ACAGGGTGCA 


10980 


TGAAGAAAAG 


TCGTATCTCA 


AAGCCATATG 


CTGGACAGTT 


CGCTGAGCGG 


ATACCGCCGG 


11040 


TGAGGGTCAT 


GAGGGAGTTT 


CGGGGCTTTA 


CGTTGGACGT 


TTCGGTTGGG 


CAAGTGCTCG 


11100 


ATGTGCGTGT 


ATTGGAG TCC 


GTGCGTTATC 


TTGATGTGTG 


TGCTCTCTCA 


AAAGGAAAAG 


11160 


GATTTCAGGG 


AGTAGTGAAG 


CGGTGGGGTT 


TCAGCGGAGG 


TCGCTCTTCT 


CACGGATCGA 


11220 


AGTTTCATCG 


TGAAGCGGGT 


TCCACCGGGC 


AGTGTACGAG 


TCCTGGCCGT 


ACGTTTAAAA 


11280 


ACGTAAAAAT 


GCCGGGACGT 


ATGGGGGCTG 


AGCGGGTGAC 


GGTGCAGAAT 


CTGCGTATTG 


11340 


AACGGATTGA 


TGTGGGTTTG 


GGTGTCGTGA 


TGGTGCGCGG 


TGCGGTGCCA 


GGTAGAAACA 


11400 


AGGCCACGGT 


GTTTCTGCGG 


ACCGCGGTCA 


AGCGTGAAAG 


ATAGGGGTGT 


ATACGCAtGG 


11460 


AAAAGACAGT 


GTATTCGGTT 


GAAGGTGTTG 


CGCTGCGGTC 


AGTTGAGCTT 


GATGAGAGTG 


11520 


TCTTTGGGCT 


TTCGGTGAAC 


CGGGGTGTGA 


TTTATTACGC 


GATAAATAGT 


GAGTTGAGTA 


11580 


ACAAGCGCTT 


GGGGACTGCG 


TGTACTAAGG 


GACGTTCCGA 


AGTGCATGGT 


TCGAATACCA 


11640 


AGCCCTATAA 


GCAGAAGGGT 


ACGGGTCGTG 


CTCGCCGCGG 


AGATAAGAAG 


TCTCCACTTC 


11700 


TGGTGGGGGG 


TGGTACTATA 


TTTGGTCCTA 


AGCCGCGTGA 


TTTTCACTAT 


GCTCTCCCGA 


11760 


AGAAGGTGAA 


GCGTTTGGCC 


ATGAAGTCTC 


TCCTAAGTTT 


AAAGGCGCAG 


GGGGATGCGC 


11820 


TGACAGTGAT 


TGAGGACTTT 


ACGGTCGAAA 


GTGGAAAAAC 


TAGGGATCTG 


ATACAGGTGT 


11880 


TGCGTCATTT 


TGCACAAAGG 


GAGCGTACCG 


TTTTCATCTT 


GCAAAATGAT 


GATGCGTTGT 


11940 


TGAAGCGTGC 


GGGGAGAAAT 


ATTCCAACGC 


TCAGTTTTTT 


GTCGTACAAC 


CGTTTGCGCG 


12000 


CGCACGACCT 


TTTCTACGGG 


CGCAAGGTAT 


TGGTTTTGGA 


GACTGCGGTA 


CATAAGATCG 


12060 


CGGATTTCTA 


TCGGTCAAAG 


GATGCTGCAC 


AAGATGGAAC 


ATACTGATGT 


AGTGATTGCT 


12120 


CCX5GTGCTTA 


CGGAGAAGTC 


GAATGCGCTG 


CGGCAACAGG 


GTAAGTACGT 


GTTCCGTGTT 


12180 


GCAGCTCGTG 


CGACAAAGAT 


TCAGATTAAG 


CAGGCGGTGA 


CGCAGCTTTT 


TGGAGTAACG 


12240 


GTTAGGCGGT 


GTACGGTAAT 


GAATGTCTTT 


GGGAAGAAGA 


GGCGTGTTCG 


TCATCGGACC 


12300 



Printed from Mimosa 02/03/22 07:21:06 Page: 219 



WO 98/59034 

GGTAGGACGT CTGGGTGGAA 
GTTCTTGAGC GTGCATAGCG 
AAGgAGACGG GGATGGCGTT 
GTTGATCTGT GTCGTGCGGA 
AAGCCTGCCA AGGCGGGCAG 
GGGCATAAGC GGAGGTACCG 
ACGGTAAAGA CTATCGAGTA 
GCGAATGGTC AGAAGCGCTA 
GTTAGCGGAG AGAAGGTCCC 
GTTGGTTTTA CGGTGCATAA 
TCTGCAGGCA CCAGGGCGGT 
CCCTCTGGGG AGGCGCGTCT 
AATGAGGATC ATATGAACAC 
CGGCCGACAG TTCGTGGTAT 
GGGCGTGGTA AGGGACGTAA 
ACGCGCAAGA AGCGCAGGGT 
GGTATGTCTA GGTCGGTGAA 
GTCGAGATGA ACAAAGCGGC 
TGTTCCACCA TTATCCCTGA 
TGGATCCCAG TGTACATTAC 
ACTCGTGTGT TCCGTGGGCA 
TGACTGAGCG TGTCACGTAT 
GCGTCCGGTT GCGAATGTGG 
ACACTTACCG CACAAGGGTG 
TGCAATTGAT CGGGACAAGC 
AGATGAGGGG CCTCGTTTGA 
GTTGAAGCGG ATGTGTCACA 
GTCAAAAGGT TAGTCCAATC 
GGTATGCAGG TCCTCGGGAG 
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GAAGgCGATC GTGCACGTTG 
gTAAGCTGCG GTAGCTGCGT 
GAAGATGTAT AGGCCTATGA 
GCTTACCGCG CGCACGCCCG 
GGGTGCTGGG GGTAGGATTT 
TGATATCGAT TTTAAACGTG 
TGACCCGAAT CGAAGTGTGA 
TATACTCGCA CCCAAGGGTT 
TTTAGAGCCC gCGAACGCGC 
CGTTGAGCTT ACGATCGGTA 
GATTGCGGCA AAGGACGGTG 
GGTGCATCGC AGGTGCTATG 
GGCTTTGGGG AAGGCAGGTC 
GGCTATGAAT CCTGTGGATC 
CCCAGTAACT CCCTGGGGGC 
ATCCGATCGC TTTATCGTGT 
GAAAGGTCCC TTCGTTGATA 
TAATCAGAGA AATAAAAAGG 
AATGGTGGGC TTCACTATCT 
GGAGGAGTTT GTGGGGCATA 
TAGCGGTTCT GACAAGAAAG 
CGAGCGAAGA CAAAATTTTT 
TGAAGTGCAA GCCGTATGTg 
CACGTTTAAT CTCCaAGGTC 
GTCTTGATGA ArAGCGCTTG 
AGCGTCTGTG GTGCCgGGGA 
TCACTGTTGT GGTAGAGGAA 
GGTCTGAGAC TGGGGATCAA 
TACGCGGCGT TGCTGCATGA 



CAGCAGGACA GTCAATTGGT 123 60 

AAGtGCCAGA GCGGTGACCG 12420 

CGGCGGGCTT GCGGGGGCgT 12480 

AAAAGAGTCT TACACGCGGT 12540 

CGGTGCGTCA TCGTGGGGGT 12600 

ATTTGCACGA CATACCTGGC 12660 

ACATCGCGCT TGTGTTTTAC 12720 

TGAAGGTGGG ACAGCAGGTC 12780 

TGCCACTCGG GGTAATTCCA 12840 

AGGGTGGTCA GATCGCGCGT 12900 

GCTATGTGAT GCTTCGTTTG 12960 

CCACTATTGG TGAATTAGGT 13020 

GTGCGCGTTG GCGTGGGGTG 13080 

ACCCGTTAGg TGGTGGTGAA 13140 

AGCCGTGTCG AGGATACAAG 13200 

CAAAGAGAAA GTAAGGGGGG 13260 

AAAAGCTGTA TAAGCGAGTT 13320 

TGATCAAGTC GTATTCGCGT 13380 

CGGTGCACAA TGGCAAGTCG 13440 

AGCTGGGTGA ATTTTCTCCG 13500 

TGGGAAGGTA GGTGAACTGA 13560 

GGTTGCtCTC CGACAAAGGT 13620 

CGCGCGATGG CGCTTTTGGG 13680 

ATGAAGTCAG CGGCTTCGAA 13740 

TTCGTGCGTG ACATTCAGAT 13800 

CGGGsGCGGG GAGATGTTCA 13860 

AGTGTGAGGA CGAAAGATGG 13920 

TAAAGTATGG TCTTCTAGGT 13980 

GGATTTAAGG ATTCG TAGCA 14040 



Printed from Mimosa 02/03/22 07:21:07 Page: 220 



WO 98/59034 




219 



TGATTCGCTC CTTTCCTGAG TGCAAAAATG CGGATATTGC 
ATCCCCAGCG AGTGACGGTA GTGATGCACA CCGCGCGCCC 
AGGGTGTAAA TATAGAAAAG ATTGGCGCTG AGGTTCAAAA 
AAATCAAGGT AAAAGAGATC AAGCGCATGG AGTTAAATGC 
TTGCTCGCCA ACTCACGGCG CGTGTTTCTT TTCGTAAGTG 
GGACGATGAA GTCTGGTGCT CAAGGGGTAA AAATTCGAGT 
CTGAGATGTC TCGCACTGAG GAGATAAAAG AGGGGCGTAC 
CGCAGATATT GATTATGGTT TTGCCGAGGC ACATACGACT 
GGTGTGGCTA TACTCAGGGA TGATGTACGG GAATGAGTGT 
GTTGCGGCGA TCGCGCAGGG AGAGTGGCCA AAAGTCTGAC 
TACGCATGCG GAGAGAGGTT GAGGTATGGC GCTTAGTCCC 
GTACAGCGGG GGAGGgTGAA GGGGGATGCC ACTCGGTGCA 
TACGCGCTGG TGTGTCTTGA GCCGTTTTGG TTGACGAGCC 
GTAGCGTTAA ACCGAAGgAT TAAGCGCGGG GGTAAGTTGT 
AAGCCATACA GCAAGAAGCC TGC AGAGACG CGTATGGGAA 
TATTGGGTTG CGGTAGTAAA GCCAGGTACT GTTCTGTTTG 
GCG TTGGCAG AGCAGGCGAT GCTTCTGGCA GGAAGTAAAC 
GCCGAACGCG TACAGGAAAT TTAGAGGGGA GCTGTAAGAT 
AATTATCATA TTCTGAGCTT CTTTCGAGGC GTCGTGAGCT 
TGCGcTTTCA GCTTGTTGTT GAGCATGTTG ACAACAAGCT 
GTCAAATTGC GGCGGTTAAT ACTTTTTTGC GACATAAAGA 
GAGGGGTTCG GGAGTGATGG AGCAG TGT AC GGTGAAAAGG 
CGGGCTGGTG AC CAGTGAC A AGATGCACAA AACCGTTACG 
GTTGCACGCG TTGTATAAGA AGTACGTGTC GCGGAgcAAA 
GGAAAATACC GCGCGGGCAG GGGATGTGGT GCGTATTGCC 
GCGTAAgCGC TGGCGGTTGG TAGAGATTGT TGAACGAGCG 
GATTCAGGTG CAGTCGCGGT TGAACGTCGC GGATAATTCT 
TATTAAGGTG GTGGGTGGAT CCCGTCGCCG GTACGCGAGT 
GGCAGTGAAG GATGCACTTC CCACTTCTGT GATTAAGAAA 
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CGAGGTGGAG ATTGTCCGTC 14100 

TGGAGTAGTT ATTGGAGCAA 14160 

GCGTTTGAAT AAGAAGGTTC 14220 

TTACTTGGTT GCGCAGAATG 14280 

TTTGCGGCAG GCCTGTGCGG 14340 

TTCGGGGCGT TTGGGTGGTG 14400 

GCCTCTGnCA nCACgcTGCG 14460 

TATGGGAGTA TCGGGGTAAA 14520 

CGCAAAGATG TAGGcTCTCT 14580 

GAGTTGGTGC GCGACGAGCG 14 640 

AAGCGGGTAA AtACCGAAAG 14700 

ATGCGGTTGA TTTTGGTGCG 14760 

GACAAATCGA AGCGGCTCGT 14820 

GGATTCGTGT TTTTCCCGAT 14880 

AAGGAAAGGG GTCGCCTGAG 14940 

AACTAATGGG TGTAGAACGA 15000 

TTCCAATCAA GACGCGGTTT 15060 

GGGTCGGGGT GGGTGTGCGC 15120 

TGAGAGAAAA TACTTGGATC 15180 

TATGAAAAGG ATTCTCCGTC 15240 

GTTGACTGAA CTAGAAAAGA 15300 

CCTGAGCGGC GCACCCTTGT 15360 

GTTCGGATTA CGACAAAGAA 15420 

AAGTATCAGG CTCATGATGA 15480 

GAGAGTCGTC CTTTGAGTAG 15540 

AAGTAAGGGA TTTGTGTCAT 15600 

GGAGCCAGGT TGGTGCAGTG 15660 

GTTGGGGATA TCATCGTGGT 15720 

GGATCAGTAG AGAAGGCCGT 15780 
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CATTGTACGA 
CAATGCCTGT 
TGTTGCGCGG 
TTTGTGAAGG 
GTGATTGCCG 
G ATCGCG TTT 
CAGGATGAGG 
ATGGGCAAGA 
GTATGTCGTA 
TCCGGTATGT 
TGCAGGTTCC 
ATCGGAAGCT 
TAAAGACTAG 
GGGTGATGGT 
TTGCTCTGCC 
GTAATTACTC 
TCGAGCGAAT 
CTCGTACTCT 
GACAGTAGCA 
ACCGCTGTGG 
TGTGTTTTAG 
AGGAAAGGAG 
GTAACGCGGC 
GGTTGTGAAA 
TGGTTCCGGT 
CGGTATCGAG 
TCGTGTGTAT 
CaGGCATGC a 
TGTAGTGTCA 



GTTTCTAAGG 
GTTGTTATCG 
GAGCTGCGGG 
GGAAAGTGAT 
GCAAAGATCG 
TGGTGCAGGG 
GGGGTATCAT 
AGGGGCCTAC 
AAACAGGAGA 
GCAGCAGATT 
TAAGCTGTTG 
TTTGGACGCG 
GGCGCGCAAG 
GACTC TGCGC 
TCGTGTAAAG 
GATGGGTATT 
TAGCGGTTTG 
TCTTACGAAG 
ATGATCAATA 



AAAGCTGGCG 
GGAGAGTATG 
GCtGcGGGAC 
ATACTGAAAA 
TGTATTCGTG 
CGGATTTCTA 
AACGGGTACG 
AGGGAGCAGC 
AGAATTGGTA 



AATATCGTCG 
ATGCTAATGG 
ATATGGATTT 
GGGGAAGACG 
GGGTAAGCGG 
TTTGAACATG 
GGAGGTTGAA 
GCGCGTGGGG 
GGTGCTATGA 
GTTCCGGATA 
AAGATAGTGT 
TCAGTAGCAG 
AGTATTGCGA 
CGTAGTAGGA 
GATTTTCGTG 
ACGGAACAGA 
AACGTCAATG 
CTCGGTATGC 
AGGCAAAAGC 
CGACCCCGCG 
AGCGAGGGTC 



ATGAAAAAGT 
CGGAAGGATA 
TGTTTCTTAA 
CTCCGGGCCG 
GCACTTTGAT 
GTGTGGGTGG 
AAGTTCCTGT 
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CGTAGACGGT ACTTATATTC GATTTGACGA 
AAATCCTAAG GGGAAGCGTA TTTTTGGTCC 
TACGAAAATC GTGTCTTTAG CTCCTGAGGT 
GTAAAGATTC GCAAGGATGA CATGGTATTG 
GGTGCAGTGC TGCGTGTGCT CCGCGACGTA 
CGCAAAAAGA CGATTCGTAG AAAGAGTGCT 
GCTCCTATTC ATATTTCCAA CGTTATGATT 
TATCGGATGG AAAACGGTAA GAAAGTGAGG 
CCGATCATTC TTGCATACCT GAACTCAAAG 
TGATGCGGGA TTTTGGTTAC TCGACGGTGA 
TGAGTATGGG TCTCGGGGAA GCGCTCGCTA 
ATTTGGGTGT TATTAGTGGC CAGCATGCAG 
ATTTTAAGCT GCGTGAAGGC AATGAGATTG 
TGTATGAGTT TCTCCACCGG CTCATCAATG 
GGGTAAGTCC TCGTGGGTTT GATGGACATG 
TTATTTTTCC TGAAATTGAC TTTGACAAAA 
TAGTGACATC TGCGCAGACA GATCAGGAGG 
CTTTTAGAAA ATAAGAGAGG ATTTCATGGC 
AACTCCGAAA nTACgctACG CGCAGGTACA 
GGTACATGAG GAGATTTCAA TTGTGCCGCC 
AAATCCCTGG GGTAACGAAG TCGAGTTGGT 
AGACATGCTC ACGAAGATAC 
GGATGTAroCT TCTTCGAAgT TGAAAGTTGA 
TATCAGGAAC TTCAGGAAAG TAGAGGAGGA 
GTATGACGAT AACGAAACGT CGGTTATTCA 
CCGTGTGTAC TCGGGGTACA AGACG CTTCG 
TGTTTCTACC TCTCTAGGGG TGACCACTGG 
TGAGCTGATT TGCAAAGTTT GGTaGGGGGC 
GTCTGTTCCT GGCGGTGTGC ACGTGCGAGT 
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15840 
15900 
15960 
16020 
16080 
16140 
16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
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CTCTTCTGGG GTGGTTGAGG TCGAGGGTCC AAAGGGGGTG CTTTCGTGTG CGTTTCTCCC 17580 

AGTGGTTACG GTTCGTGTTG AGCAGGAATA CGTAATTGTT GCCCGGTGTG ATGATTCCAA 17 640 

GCGCGCGCGT GCATGTCATG GGCTGTATCG CAAGCTTTTG AGCAATATGG TAGTTGGGGT 17700 

AAGCGAAGGg TTTTCTAAGA CATTGGTAAT TACGGGTATC GGGTACCGCG CTGAGGTTCA 177 60 

AGGGCGGGTG CTGGTGATGG CATTGGGTTA CTCCAATGAC TTTACAGTGC TCATTCCCTC 17820 

TGGTATTGAG GTGCGGGTTG AGTCTTCCAC GAGGGTTATT GTTTCCGGTG TAAGTAAGGA .17880 

AAGAGTGGGG GAGTTCGCAG CGCAACTTCG TAGGCTGCGG TTGCCTGAGG CGTATAAGGG 17940 

TAAGGGTATT CGC TATGATT ACGAGACCAT TGTGCGTAAG GTAGGAAAGT CAGGGGTAAA 18000 

GTAGAGGTAC GCATGCTAAG GAAGTGCAGT GATAAACAGC GAAAGAGGAT GAAGCGTAAG 18060 

GTTCATATTA GGAAGAGGGT GTATGGCACG gCGGTTCGCC CTCGGATGAC GGTGTTCCGA 18120 

AGTAATCGGA ACATTTCGGT GCAGGTCATT GACGACGACG CGCGTAgCAC GCTTGCGTCA 18180 

GTTTCTACTC TTGAGAAGGA TTTTGTTCTG CTTAGGGCAA ATGTTTCTTC TGGTTTGCAG 18240 

ATAGGAGAAG AGATCGGCAG GCGCCTTTTA GAGAAACACA TTGACACGGT TATCTTTGAC 18300 

CGAAATGGGT ACTTGTACCA CGGGGTAGTG GCGGCCGTCG CAGATGGTGC ACGTAAGGCA 183 60 

GGAGTTAAGT TCTAGGAGAG CGTATGGATC GTCACAGGGA TTTTGGCAAA GACAGACTTC 18420 

GAGACAAAGA GTTTACCGAG AAATTAATCA AGCTGAACCG CACGGCAAAG GTAGTAAAGG 18480 

GCGGACGTCG GTTTTCCTTT TCGGCACTCA CGGTAGTTGG TGATCAAAAG GGCCGCGTGG 18540 

GGTTTGGTTT TGGTAAAGCC GGGGATGTGA GCGAGGCAAT TAGGAAGAGT GTTGAAAGGG 18600 

CGAAGCGGAG TATGGTGCTC TTTCCGCTCA AGGATGGTAC CATCCCGCAT GAAGTACAAG 18660 

CTAAGTTTAA GGGCTCTCTG GTGTTACTGC GCCCTGCCTG TTCAGgTACG GGTATTATTG 18720 

CTGGTGGAAC CGTGCGTGCT ATCATGGAGG TTGCAGGTGC AACCGATGTG CTGTCTAAgT 18780 

CTTTGGGTTC GAATtCTGCT ATCAACgTGG TTCGTGCaAC gTTTGGGGCG GTTGCscAgT 18840 

TGATGGATGC aAGAAAGTTG GCACgTGAGC GTGGGAAGGC ACTCGTGGAT ATGTGGGGGT 18900 

AGGCATGACA AAGAGGGTGC GTATAACGCT GGTGAGGAGT ACGATCGGTC AGAGGGAGCC 18960 

GGTGCGTCGG ACGGTTCGGT CTTTGGGTTT GAGGAAGTTG CATTCAATGG TGGAGAAAGA 19020 

CGGGAGTCCT GCCGTCTTGG GGATGGTGCG AGCTGTTTCG CACCTGGTGC GGGTGGAGGA 19080 

GTTAGGTTAG TGGCGGATTT CCATTTGATT GCTCCGAAGG GGgCTAATAG GGCGCGTCGT 19140 

ATCGTGGGTC GTGGGTCCTC CTCTGGGCGG GGTACCACGT CTGGGCGGGG TACTAAGGGA 19200 

CAGCAGGCCC GTGCGGGGCA TAAGGC TTAT GTAGGTTTTG AGGGTGGGCA GATGCCGCTA 19260 
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TATCGGCGTG TGCCGCGGCG 
GTTAATGTGG GCGCGCTTGA 
CTCATTGAGA AGGGCTTGGT 
GAGCTGACAA AGTCTATTGT 
ATTCAGCAGG CGGGCGGTTC 
GAAACAGGGT GTTTTTGCAG 
CACGCTTAGC GTGTTGACGG 
CCCGCGTGCG CTTTCTGCTT 
CATGGATTTT TTTGTAGGCG 
GCCGTACATT TCGACGCAGA 
GAAGGTTGTA GAAGATGTAG 
GGTTTTTGTG TGTCTTATAC 
TGCCATTGTT ATTCAGAGCT 
AGGGAGTATG ATCACGCTTT 
TGTGTCAATG ATTATTTTTT 
GTGGAGGCTG CAGCGTCTTG 
GTTTGTAGGA ATTATTGTGC 
tCATTATGCG CGGCGTGTGG 
TTTTAAAATA AAmCcTTCGG 
TCCCCTGCAG ATAGCCAGCA 
GTTCTTACGA CCGAACAGTT 
TGCGTACTTC TACACGCAAG 
GAACGGAGGT ACGATTCCGG 
CTTGAACCGC CTGGTACTTC 
CTTGATTCAA GCTGCGTTTG 
TCTGTTGATT CTGGTAGGGG 
AATGCGGCAG CGTGAGGGGT 
GTACTtGCGA AAGGATG TAT 
TTATTTGTGA TAAGTGTAAG 




GGGTTTTTCT AACTGTGCTT 
GTTTGTCTAT GCTCCAGGGG 
AAAGGGGCGG GTCCCCTTCA 
GGTGCGGGTG GACCGGGTTT 
AGTGGAGTGT ATTGAAGCGC 
CGGTGTTCCG GATAAGGGAg 
TGTTTCGCTT TGGCTCGGTG 
ATTTCCGATC TCAGGTTCGG 
GGGCGTTCTC GAATTTTTCA 
TTCTCATGCA GCTTTCGATG 
GGGGGAGACG TCGCGTTCAG 
AGTCTTCTGC GGTAACCGTT 
ACGCCGTGCA TCTGTTTGTC 
GGCTTGGGGA ACAGATCACA 
CGGGTATTGT CGCGCGTTTG 
GCGAATTGAA TATGGTGTTT 
TGGTGGTGTA TGAGCAGCAG 
TCGGGCGGAA AATGTACGGT 
GCGTAATTCC G ATTAT TTTT 
GTATTGGACC GAACGTGCGC 
GGTGGTACAA CGCGTTCTAT 
TCACCCTTAA CCCGACTGAG 
GTATTCGTGC GGATAAGACG 
CCGGTTCGTT GTATCTTGGG 
GGTTTCCGTC CTCTATTTCC 
TGGATCTAGA CACTATGAGT 
TGGGAGGGCG TGGCAAAGTG 
ACGAGGAGTG AGTTCATGAA 
CTTATTAAGC GTTTCGGTAT 
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TCAAAAAGGA ATACGCGGTA 19320 

AGACGGTCAA CAGACAGACT 19380 

TCAAAATCTT GGCAGACGGA 19440 

CTGCTCGTGC ACAGGAGAAG 19500 

AGGAACGATG AGCGGTATAT 19560 

CTGCGTGCGC GTATCTTTTT 19620 

CTGACAGTTC CGAGTGTGGA 19680 

GGAAATGCTT TTGCAGACTA 19740 

GTGTTTATGC TGGGAGTGAT 19800 

ATTGTTTTTC CAAGTCTTAA 19860 

TTTTGGACAC GTGTTGCAAC 19920 

TACGCAAATC AGATTCCCGG 19980 

ACCATGCTGA CGGTGACCTC 20040 

GCGCGAGGCA TTGGTAACGG 20100 

CCTCATGCGC TTGCAGAGAT 2 0160 

GTGATCGTTG CGTTTGTGAT 20220 

GGGCAACGAA AAATACCAAT 20280 

GGTCAGAGCA CGTATATCCC 20340 

GCCTCATCTT TTTTGACATT 2 0400 

TTTCTGCATC AGcTTGCGCA 20460 

GTAGTTTTGA TTGTGTTTTT 20520 

ATAGCAAAGC AGATTCGCGA 20580 

GAAGAATATC TACAAGGGAT 20640 

ATGATCGCAG TGCTGCCCAC 20700 

TTACTGATGG GCGGTACTTC 20760 

CAGATTGAGG CGCAGTTGAA 20820 

CTACCGCGCA TTTGTAGCGG 20880 

GATAAGGACG AGCGTaAAGG 20940 

TATCCGGGTG ATTTGTGTGA 21000 
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ATCCAAAGCA 


CAAGCAACGT 


CAGGGCTAAG 


GGGGTAGACG 


AGGTATGGCG 


CGTATTGCGG 


21060 


GGGTTGATCT 


TCCTAATAAG 


CATGTCAGCG 


TTGCGTTAAC 


TTACATATAT 


GGTATTTCGC 


21120 


GTTCATCCGC 


CAGGACTATT 


TGTGAGAAGG 


CCCGCATCAG 


TTCTGCTTGT 


CTGATAAACG 


21180 


ATTTGAGTCA 


AGATGAGCTT 


GCAGTTGTCC 


GTGCAATTAT 


CGATAGAGAA 


TACAAAGTGG 


21240 


AAGGTCGTCT 


GAGAACTGAG 


GTTGCCTTAA 


ATATCAAGAG 


GTTGATGGAT 


ATTGGGTGTT 


21300 


ACCGAGGGCT 


AAGACATAGA 


AAGGGGCTGC 


CTGTTCGTGG 


GCAGCGCACG 


CGAACAAATG 


21360 


CGCGCACACG 


CAAGGGTAAG 


AGAAAAACCG 


TCGCTGGAAA 


GAAAAAGTAA 


GGGATCAGGA 


21420 


GGGCATTGTG 


GCGGTCACAA 


AGAAGCGTAA 


AGAAAAAAAG 


AATGTGTACG 


AGGGGAACGT 


21480 


GTATATCCAG 


GCGACTTTCA 


ATAACACCAT 


CATAACGGTT 


ACTGACCTGC 


AAGnAAATGC 


21540 


GCTCTCCTGG 


GCTTCGTCCG 


GGGGCCTTGG 


GTTTAATGGG 


GCAAAGAAAT 


CTACTCCTTT 


21600 


TGCAGCACAG 


ACGGTCGCGG 


AAGCTGCGGT 


ACAGAAAGCG 


CAcAGTGCgG 


acTGCgTGAA 


21660 


GTACATGTGT 


TTGTCAAAGG 


GCCGGGTATT 


GGGCGTGAGT 


CAGCAATTAG 


AATGCTTGGT 


21720 


ACCATGGGAC 


TGAGGGTGCG 


TTCGATTCGC 


GACATCACAC 


CCATTCCACA 


TAACGGCTGT 


21780 


CGTCCGCGTA 


AAACTCGCCG 


CATCTGATAA 


AAGGAGTGAG 


CATGCCTCGT 


AGAAATCTTT 


21840 


TGAAGGGTTT 


TAAAAGACCT 


AAGGTGCTGG 


AGTTTCTTTC 


GGAGAACTCA 


AGCGAGTGTT 


21900 


ATGGGAAGTT 


CACCGCCTCT 


CCTTTTGAGA 


CTGGTTTTGG 


CACCACTGTT 


GGTAACTGTT 


21960 


TGCGGCGCGT 


CTTACTCTCT 


TCTATCCAGG 


GGTATGCGGT 


CACCGGGGTT 


CGCATCACGT 


22020 


CCTTTGATGC 


GGACGGGGTT 


GCGCACTTCA 


TTTCAAGCGA 


GTTTGAACAG 


ATTCCCCACG 


22080 


TACGGGAAGA 


TACCCTCGAG 


ATTCTAAATA 


ATTTTAAGCG 


TCTGCGTTTT 


CTCCTGCCGC 


22140 


AGGGGcAGAG 


TCTAGTACGT 


TCACGTATGA 


GTTTCGCGGC 


GCGgTGTCTT 


TGACGGGGAA 


22200 


GGACTTTGCT 


AAGAAGTTTC 


AACTCGAGGT 


TCTGTCTCAA 


GACCTGCTCA 


TCATGGAAAT 


22260 


GATGGACGGT 


GCGCATGTTG 


AAGTAGAGCT 


ACACGTCGAA 


TTCGGGCGTG 


GGTATGTACC 


22320 


TGCTGAATCG 


CACGATCGGT 


ATGCCGATTT 


AGTTGGGGTT 


ATCCCTGTTG 


ACGCAATTTT 


22380 


TAGTCCCGTG 


TTGAGAGTCC 


GCTATGATAT 


TCAGTCTTGC 


CGTGTAGGTC 


AGCGGGGGGA 


22440 


TTACGATCAG 


TTATCCCTTG 


AAGTGTGGAC 


AGATGGTACG 


GTGCGTCCCG 


AAGACGCGAT 


22500 


AgcCGAGGCA GCGAAAATTA TCAAGGAGCA CTTTACAGTT 


TTTGTTAATT 


TTGACGAGAC 


22560 


CGCGCTCGAC 


CTGGAGGACG 


AGCCAGAAGA 


GGATGACCCT 


GCCGTTCTGG 


AGCTGTTGAA 


22620 


CACGAAAATC 


GC TGATG TAG 


ATTTTTCAGT 


GCGCGCGCGT 


AACTGCCTTT 


TAACTATGGG 


22680 


AATCAAGACG 


CTGGGGGAGT 


TGACAAGGAT 


TTCTGAGCAG 


ACACTTGCGA 


AT ACGCG TAA 


22740 
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TGTGGGTAAG AAAAGTTTAA GTGAGATACA 
GGGTATGGCT GACTACAACC ATGTGGGGGT 
AATAGATGAG GCATAGGACC GGTTTCAACC 
CGCTCCGTCG CAATATGGTT ACTTCTCTTT 
CGAAAGCTGC CGAGGTGCGG CGCGCGGCAG 
CTGTGCATAA CCGGCGCCAG GTGGCCCGTT 
TATTTGCGGA TATCGGACCT CGCATGCGGG 
AGTTGGGCCT CAGGCAGGGG GATGCGGCAC 
CCTTTGAAAA AAGCCTCAAA AAACGCGCGC 
CTGGGAAGAA GGaTGcTTCG CGCGTCAGTG 
TAGGAAAGAA GAAAGAATAG CAGTTGGGCA 
TGGAAAGGGG ATCCGGGGTA TGGTCGGTCG 
GACGGGGGTA AAGCTCCTGT ATGAGTGCGA 
GGTTGGGCGC GCGACTCTCC AGAATAGGAA 
ATCGCGCATC CTCGTGATAT GAGGTTCCGT 
GTGCTTGGCA GTTACCATTG GGATGCAGGT 
ATAGGTGTTT TCTTGACCGA GGGCGGCGTC 
GTGTTATGGC AAAAAAGGAG AAGAAAGTGT 
CCTCAGGTTG TGACGAGGCC TTGGAGCGGG 
CGGTTGAATC GGGGGAGGGT TCTGTTCCTG 
CCTCTGAAGA GACCCTGCGC GArCGCGTGA 
CTGCCGACCT CGAAAACTAC CGGAAGCGTG 
AnCGTACGCG GCGCTGCTTG CCGACATCGT 
TGAAGCGGCG GATCACGCGT CGAGTACAGA 
TGTTCTTATG ATCCGCAAGC AGCTCTCCTC 
TTACCCGGTG CTCGGGGAGC GCTTCGATCC 
TTCCGCTTCT GTGCATGAGA AGATAGTAGG 
GAACCGTATC CTCCGGCATG CCAAGGTTAT 
CGATCGTGGG GATGGcCCTT CGGAGTGACA 
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GGgCAAGTTG CAGGAATATA ACTTGCGTCT 22800 

TGTTAGTAGA CTGATGCGAC AGAAGGAAGA 22860 

CG c TTTCGTG tATGGCTGCG CATAGGCGTG 22920 

TTAAGTTTGA GCGGATCACC ACGACGAAGC 22980 

AGAGGTTAAT TACGCGTTCT AAGTCTGACT 23040 

TTATTTGGGA TAAGGCTGTG TTGCACAAGT 23100 

AACGTGAGGG GGGGTATACG CGCATATTGA 23160 

ATGTGGTTGT GTTGGAATTG GTTGACTATA 23220 

GTACTGATAG TGTGCCTGCA AGAAAAGGAG 23280 

GGACGGTTCC AGACGGTCAG TCTCAAAAAA 23340 

ATGGAGGGGT GGTATGTCGA AGGCTCATCG 23400 

TGGCCGTGGC GTGTGTCCGG TGACTGGGCA 23460 

GATTGATGGT AAGAAGGTCA AGGTTTCCAA 23520 

GAGACGTTTG GATGCGCAgC CTGGAGCTTG 23580 

CCCAAGGACG TTAGGTGGTT GTCCGTTTCT 23 640 

CGCATCGTGG TCGGTGTAGT CAGACGGTAA 23700 

TCTCGTTACT TTTACGGCAT TACCGCGAgG 237 60 

GCGGCGGCGA CGTTCAGGGG CAGGGAGTTG 23820 

CAGATAGCCT TCGCGCGTCT GATC CTGT AC 23880 

GGGAGCATAG TCaGGAGTTG GAGACAGGTG 23940 

ATGTTTTGCA GGAs CAGTAC CtGCGCAAGG 24000 

CGTTGCGGGA AAGGCAGGAG gCGGTGGAAC 24060 

CGCTGTCTTG GATGACTTTG ACCGTGCTAT 24120 

GGTGGAGGCT TCATCTGCCT TCCGAGAGGG 24180 

AGTGCTTGAG ACAAAGTATG GTCTTGAGTA 24240 

AAATCTCCAT GAGGCTTTGA GTATGAGTCC 24300 

GGCAGAGCTA CAAAAAGGAT ATAGGGTTAG 24360 

GGTGCTCACT CCTGAAGAGC AG AC AGAGC C 24420 

GGCAGGGTAT GCTGAGAGGT CAGGATGGAG 24480 
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TTCTGGAGCA CCGGTGCTAG 
CCTATACAGA GGAGTTGAGG 
ATTCATGTGT TGCGATCATG 
GAAGGACTAC GCCCTCCATT 
CAGCAAAAAA CCAAATGGTT 
TCGGCAGTCG TTTCAATGAA 
CACAGGGAGA CGACGTGCGC 
CCGCGTTCAT TTTGCAAAAA 
CAGAGGCAGT CATTACCGTT 
ATGCGGGGAA GATAGCAGGG 
CGCTTGCCTT TGGTTTTAAC 
TTGGGGGGGG T AC CTTTGAC 
AGTCAACGAA TGGGGACACT 
GGCTGGAGCA GGGCTTCAAG 
TGCAGCGGCT GAGAGAAGCG 
CCGAGATTAA TTTGCCCTTC 
CTCTCTCTCG ATCTGAGTTT 
CTTGCCGCAA GGnGCTCAAA 
TAGTTGGTGG TTCCACGCGC 
AAGAAGGATC GAAGGGAGTC 
GAGGTATCCT CGGGGGGGAC 
TAGGAATTGA AACAATGGGC 
CCACGCGCAA GAGTCAGGTG 
ACGTGCTGC a GGGGGAGCGT 
TAGTAGGAAT TCCCCCTGCT 
ATGCGAATGG TATCGTGCAC 
TCCGCATTGA AAGTTCGAGT 
CCGAAGCGAA TGCAGAAAGT 
CTGACTCCCT AATCTATCaG 




225 



GTAACGGCTA TACTGCGCgC 
GTTATGGGGA AGATTATTGG 
GAGGGGGGGG AGCCCGTTGT 
AyCGGTTTCA CCTCTGATGG 
ACTAATCCGG AACATACTAT 
CTGACCGGTG AAGCAAAAAA 
GTTGAGGTGG AGGGTAAGCT 
ATGAAGAAGA CAGCTGAGGA 
CCGGCTTACT TTAACGATGC 
CTCGATGTGA AGCGTATTAT 
AAAGACTCTA AGAGAGAGAA 
ATATCCATCT TGGAACTCGG 
CACCTGGGGG GCGATGACTT 
AGTGACACGG GTATCGACTT 
GCGGAGAAAG C AAAGAT AGC 
ATTACTGCAG ATGCCAATGG 
GAGAAGATGA CTGATGATCT 
GACGCCGGAA TTAGTGCGGA 
ATGCCCAAAG TAGCGCACGT 
AATCCTGACG AGGCTGTCGC 
GTGAAGGATG T AC TTCTC TT 
GGGGTGTTCA CTCCGCTTAT 
TTTTCCACCG CAGCTGATGG 
GGCATGGCGA ACCAAAACCG 
CCGCGGGGAG TGCCGCAAAT 
GTTTCTGCCA AAGACCTAGG 
GGTCTGAGCG AAAGTGAAAT 
GATAAGCGTG AGCgGGAGAA 
ACGGAAAAGA CGCTCAAGGA 
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CCTGCAGGCA GGGCGGGTAT 24540 

CATTGACTTG GGAACGACAA 24600 

CATTCAAAAT GCCGAAGGGG 24660 

TGGACGCGTC GTCGGTCAGC 24720 

CTATTCGATA AAGCGCTTTA 24780 

GGTGCCCTAC AAAATTGTTC 24840 

TTACTCTACG CAGGAGATCT 24900 

TTATTTGGGC GAGGCAGTCA 24960 

ACAGCGTCAG GCAACCAAGG 25020 

TAATGAGCCG ACTGCTGCGT 25080 

GATTATTGCT GTGTATGATC 25140 

TGACGGTGTT TTTGAAGTCA 25200 

TGATGCACGT ATCGTGCAAT 25260 

GGGCAACGAC CGCATGGCGT 25320 

GCTTTCTTCC TCTGCGAGTA 25380 

GcCAAAGCAT CTCCAGAGGA 25440 

TTTTGAGCGG ACCAAAGAGC 25500 

CAGGATCGAT GAGATTCTCT 25560 

GATCAAAGAT GTCTTTGGGA 25620 

AATTGGCGCT GCAATTCAAG 25680 

AGACGTTACG CCTCTTTCTC 25740 

CAGTCGTAAT ACCACCATCC 25800 

GCAGACGGCA GTTTCCATTC 25860 

GACGCTCGGT AATTTTGATC 25920 

TGAAGTGACG TTTGACATTG 25980 

GACGGGAAAA GAGCAGCACA 26040 

CGACCGCATG GTAAAGGAAG 26100 

AATc GAAGC A CGTAACGTGG 26160 

GGCGGGAGAC GGGGTGAACG 26220 



Printed from Mimosa 02/03/22 07:21:15 Page: 227 



WO 98/59034 



226 

C^CCG CGCGCGCATA (SACGAGGCGA TCGCAGAGTT GAAGACGGTG C^TCcAGGc 
GACGACGTCG CATCGATCAA AGOGAAGACT GAGA^ ^omC CTACAAAArT 
^GGAAA TGTATAAACG ******* OCOQ^OCCQ C^CACGTAA GAAGAGTGAT 
C^CCCTC^ C^AATGAGGC AGAAGGTGGT GACG^ATT ACGAGGTAG. GAAGGACGAA 

GATTCAAAGT aggca^ TCTTC cgggg agggaatagc c^tag GAGC TCT G TC 

ATCTGACTTC CCCCAGGCCT TTTGTGATCC GGGTGTTCGC CTGA^CC GGGTCTTTCG 
qCTGTCTAGT GGGTGTTTGG ATGTAGCCTG CGTAGGCGGT GCTTCAGGCG TCCTGCTTTT 
GTGCCGGTTT CGCG^CACA CTGTGTGTGC GCGCAAATGT AGACAAAGAT 

TCTCTAGACG GGGTGATCGT GGCAAAGAAG GATTATTACG AGG^^ TATCTCAAAG 
ACCGCGAGTG GAGAAGAAAT CAAAAAGGCG TACCGGCGGC TGGCTATTCA GTTTCATCCT 
GACCGTAATC AGGGAAATAA AGAGGCGGAG GAACGCTTCA AGGAGGCTAC CGAAGCCTAT 

o^aarc, — tgcaca gaagcg^c gcgtacgatc ggta^gc- — - 

A.GGATATGC ACGGTGCGCA ^ TC^GGCcT TTCAGGGGTT CGAAGATATT 
TTTGGGGGTG GCTTTTCTGA TATCTTTGAA AATATTTTTG GGAcTTCGTC TCGCCGCGGC 
^AGGGA ACGACGGCTC GGGTGGCTCC GGGCGTGGGG CAAACTTGCG TTATGATTTG 
CAAATCTCTT TTGAAGAAGC AGTGTACGGG AAAAAGAGTG AGCTGCACTA TGTGCGCGAC 
GAAACGTGTA TTACCTGCAA GG tGCCGGCT CGGCCAGCCG TGGGCGTAAG ATGTGTCCAG 
ATTGCAAGGG TACGGGGCAG ATTCGGCGTA GTACAGGTTT TTTCTCTATT GCGCAAAGTT 
GTGCGCGCTG TGGTGGTGAG GGGACGATTA TCGAAAGTCC CTGTGCACGG TGTGCGGGTA 
GTGGCATTGA GCGTAAAAAG CAAAAAATTA TCGTCAGTAT TCCGGCAGgT GTAGAAGAAG 
GGCGGCGCAT TACTATTCCC CGTCAGGnAA ACGCCGGTCG CGCAGGCGGT GCCTACGGGG 
ACCTGTACGT GTTTGTGTTT GTTCGTGCGC ATGAGTATTT CGAACGTGAA GGTGCTGACC 
TGTACTGTGC AACTTCGATA TCGGTAACCC AAGCGATTTT GGGCGCGCAG GTGACGGTGC 
GGGCATTAGA ^ATC^CG CACAGn^CG GGTTCCGGCC GGCACGCAGG GAGGTGCGCT 
^G^ AAGGGTATGG GGGTCCCACT GGCGCGCGGG GCGGGGGATT TGTACGTAAA 
GGTATTGGTG CGTATTCCAA CTACGCTTTC TGCACGGTCG CG^CGCTCT TAGCGGAGAT 
TTCTCAAGAG GAAGGGGAAA ACGCCCA TC C GCCGTTGCTT GAACTTTCAA GTCTCAAGTA 
GGCTACAGAA AGGGGCGCGT GGGGTAAAAG GATTATTCTC GCGTGCG^G TGTTTCTTTC 
TCGTGTGTCG CAGGATGAGT TGGcTTCATC GTGAtGGGTG CGTGTGCTAT CTGAGTTTTC 
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26280 
26340 
26400 
26460 
26520 
26580 
26640 
26700 
26760 
26820 
26880 
26940 
27000 
27060 
27120 
27180 
27240 
27300 
27360 
27420 
27480 
27540 
27600 
27660 
27720 
27780 
27840 
27900 
27960 
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TTCCCACAGT TTAAAAGACA ACGTGTTTTT GAAGCAGCCA TACAAAGGGA ACGGTAGGTG 
ATTCGCAGAA GGCTCGCAAT TGTAAAGGCA GGTTCATTCG CACTCCTGGC GCTTTTTTTT 
TCAATATTTT TGCGCTTTCT CAGTCCGCGG TATTCGTTTC TCGGTCGTTT CGTTTCTGCG 
CGCGATATGG CGCTGTTGAT TTCTCGGTAT GAGyATTTGC CTGAGCTTTC TTCGCGTGAT 
CGAGCCTTGC TGGTAGGTTT CGTTTTCATG ATTTTTnGGT TGCGCTTACA GAAATCCAAC 
GCTATGCGCA CGGGCGCATC CCGTCTTGTT GTCTA 
(2) INFORMATION FOR SEQ ID NO: 9: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5199 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
AACTTTGTGG TGATTAAGGG GTTGGAGCGA TATCAACGCT GGGATCTTGC GCGGGAGTGT 
TCTATCCGTC ATCTCTATTA TGTGTTGGAT GcTTTGCAAT TGAACGATCA AACAAAGCGT 
GGGGTTCTGT GGGAAGCGTA TCTGCCTACG CGTGAAGGTC CTGCACAATG GCCAGGGAAA 
GAAGGATTTC CGCGCAGGCA ATATCTTGCG TACGCTGcGC TTTCTACTAT CACGCTTATG 
ATAGAAAACG TTATCGGTCT TTCCATCAGT TTGCCGCGCA AAACAGTGCA CTGGATTATC 
CCTAACCTGG AGGTGAtGGG CATTGAGAAT TTGAGCT t G A AACGGAATCT CATTACGATT 
CTCTCTTCAA AAAG TGTGCG GGGGTGGGAA GTCTATATGG AAAGCGAGAA ACTTTACTAT 
TTTACCCTCA ACATCCTTGG ACAGAAAAAG AAGACGCTCC CAATCCCCTC GGGGAAATGC 
TCAATGCTCG TCGATAAGTT ATAGTGCGAT AAGAAATGTT TTACGGCGCG TGGGTGCTGC 
GCGACGTAcT GCGTTTTCTC CAGTGGCGGA G AAAGTTC TG CTAGCCCTTA GTCCAGAGAA 
GATGGGATGC GGCTGAGGAG CTTAAGAAAG AAATCAAGTT CTGCACGTAC ACGGCAGAGG 
AAATTATATT CTAAATATTC TGCGCGCTCT CCATCGGGAA AATGCAAGTA GAGGTAGTCG 
CTTCTCCGTA TGAGCGCGCG GATAACGTCT TGGTTAGAAG GTGTAGCCGT GGTGAGCATG 
TGCTTGAGCT GGAGcAACgC TGCGCGCTCT GTGGTCAATA GTTCTTTTAC GCGTAGACAA 
AAGTCCTCTG CAGGTTTTTG AGAAAACGCA AAGGATATCT GTGCGTCTTG TGTGTGCAAA 
ATGGGTACAT GCGTTCTATG TCTGCGCGTA CAGCAGTGGG TGATGATCTC ACACGCCTGT 
GCGTGCGAGA CTGTGTGTGA TGTGGTAAGA GCAAGACCTT CTTGGTCTAG ATCGTAAAGG 



28020 
28080 
28140 
28200 
28260 
28295 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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TGCGGGTGCT CAGCGCAGAG GGTACGATAA GCGTGTGCAT AGTGCACAAG GTTGGGGATA 1080 

CTTTGTACCG CGTATCTTTT CCCCTTAAGG GAGCGTATCC CTTCTGGGAA CAGCGCAGAG 1140 

TCCGGATATA GGCTATGGAG GCGCGTTGTG CTGTGGTACA GTCTTTGTAC TGGAGCACTT 1200 

CCCCTCGCAT GGGTAAaTCC CTTGTTCCAT GAAAAATiTA AACCGGTGTG AAAAATAGGA 12 60 

ACTCCCTTTT TTCTCAATCG GAGTGCAAGT TGCAACGCGG cGAGTCCTAC TGATCCACTA 1320 

GGATGAATGG AAAGCGGCAC GACGCCCgCA TTCTGCGCaC GTTTGATAAA TGCgGCATGA 1380 

GTGTACGGGG TGAAAAAGAA GTGTGTAGGC ACGTTCGCTG CACGCACCGC ACGGGGAAAT 1440 

GCGCTTAGAT CCGCAAAGAG CGCAACGGTG CGCGGgAGTG TACCTATGAA TGCTTGTTCA 1500 

ATCCAGAACT GTGATTCTAA TAAGACAACT GCATCTGGAA CAATGTCAGG TAACAGTGCG 1560 

TTGCAc GCCA CATCTACGGC AAGTAAAAAG ATGGTGTCCT TCATGCGGGC ACAAAAAGAA 1620 

CGGCAGGCAT CCAGCGCAGG ACCGGCACCT ACGATAAGGA GTGGTTTGTC TATACTCTGG 1680 

GGAACAAGGT GGTGTATGTG CGAGGGATTT TGTAATTGCG TAATATAATT TGAGAATATA 1740 

TTGCGCGCAT AATTCCTTCC TGAATGGATG AGTGTAATCT TATTGATCCA AAAGGTGTCG 1800 

ATGAGAGTGC GTATGTTTTG CTCGCTTTCG TCATAAAAAT TCCGGTACTG CGCATATGCG 1860 

CCGGaTCCCG CAATTTTCAG TATCTGTTTG AAAGGGAAGC GGGTGAGACG CTCCACGGTA 1920 

TGCAGCACTT GGGTGATGTG TGTGGTGTAT AACACGTACA CATTCTGTGC GGTGATAAGC 1980 

TGGCGCGGAG cgTGcTGCAT AAAAAGGTGC ATAAGCTGCA GGTCACATTC AAGACAGAGG 2040 

AGAAATGAGG AAGGAGGCAT ACGAGTAAGA AGCGCACATA GGCCGTGGCC AAGCACTGGC 2100 

GCGCAACAAA GTACAAGGGT ATGCGGCTTG ACCgCTAAGm GcGCCACGGC ncgCTCaTGC 2160 

GCATCCTGTG CGCGATACTT TGAGTAAAGG TAGGTGTTGC GGTAAAGAAC GGTGAAGCCg 2220 

TTTTGTGTCT TGATAAGACG CGGCGGGAGC GAGGGAACGT CGCCACGCAC GCCAGCGACG 2280 

TCAGATGTAC C CAT AGAAGG GAAACGCACT CACAGCCGCG CCACGCACAG GgCTAGCGCC 2340 

GAAAGATATT GTCAAATGTA TCCTTAAATG AGGGATGAGC CAGGATAGTA TCTACGACCG 2400 

ATTCCTGTGC CCATGGCATC TTTGCCGTGT TTAGCAGCGC AAGCGGGTAG CCCGGGTACC 2460 

AAGTGCGCAC ATGGTGTGCA TAGCTGTCTC CGGTAGTAGC GAGCACTGAT TCGTCCACTG 2520 

ACTTTTCCTC GTGTAACTGG AGCAGTACTA CGGAACTATC AAGAATAAGT GGGTCAGACA 2580 

CCTGCCCGGG GAGAAGGGCG AACGCGGTGG AAAAAAATTT CTCGTCATAT GCAATCCGCG 2640 

CTAACGGGGG GTCGGACTGG CGCGGAAGGG CAGGGAGCAC GTCGACATTT CCGAAATTAA 2700 

TGGGAAAGGA ACGACTCGTA TGCACGTCTA AGTTAAGGCT CTGTGCGGCG GCGGTGAATC 2760 
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CACCCTGTTT CGCCCGGGTG GAAAAGGTAT GTGCTTCCTC CTCAAGGAAA CGTTCGATGG 2820 

TGCCCCGCTC CACACGAGCC ATGTGTGCGA ACACGCGCTC CCGCGTTGCT GCGTCACTAA 2880 

AGTCCGGAGC GCTAGGTTCA GCGTCGGTGC GCACGATGGC AAAACCGCGC TCTATCTTCA 2940 

CTACAGGACT CAAGGCTCCC ACCGCCgTGC GgAGCACCGT GTCCAAGTCC TGCGCGTCGG 3000 

GAAAGAATTC GTTCACATCA CTCCGATAGG AgTGGGTCAT TTTTCCGGTA GCATCGGTAC 3060 

CAACTTTGGT GGAGCCAGTA GCGACAGCGT CTTCAAAAGA C AA TTCCCGT TTCTCTAGAG 3120 

CGCGTGCCGT GCGGCGCGCG TcCTCTTCCG AGGAGTAGGT GAGCAAGGAA AGGTGATGCA 3180 

GGGTAAAAAG GTGGGCATGC TCTTTCCCGT ACgCGCTGAC GCkTTCAGCG GGAAACCGCT 3240 

CTTCGCCCAA AACGACGTAA CGGAAGCTAC GTTCCTTCTT CGCCATGTCC TGAACAAAGC 3300 

GCAGTTCTCG GCTATTGAGC TTAAGGCCCC CGCGTCCTGT CTCTTTTCCA AAGAGGTGGT 3360 

AAAGGTACTG ATCGGAAAGG AGCGAGTCGC GCATCTTTTT GCGCTGAGAA AGCCGGACAT 3420 

GTTCAGGGGT GCCCTGATAG CGCTGCGGCG AGTAAGTACC GTCAGCGTCA gctAaAAAGG 3480 

AGAGCACCTC CCGATCTAGC AGCTCCTCGC TGAGGGTAAA GCCGCTTTGC TTTGTTTGCT 3540 

CGGTTCCCGC AAGCTGGACA ACCGCGGCGC GAAAGGCAGC ACGTAAGACA CGACGATCCA 3600 

TCCCCTCGCG TTCCTGAGCG TCCTTGGGGT ACAAGTTATA GCGcTCCGCA GTCTGTGCAA 3 660 

GCGCAGAGTA CTGCTGGGAA AAAAGACTAT CAGGGGCATT GGTGAGCGCG ACCCCTCCCC 3720 

AGGAACCGAG TCGTACGTGG CCGTGCCCTC CCCCGCTGAG GGCAGGGAGA AACACGAACA 3780 

TGAGCGCCGC CACCGCCAAC ACCACGCAGC CGCCCGTCGA GGCAAGCGCC CCCCTGCCGA 3840 

AGTAGAACGA TTTCATGGAc TGCGCAACTG TGGCACAGCG GAACTGCCCC TGTCAACACC 3900 

CGCGCAGaGT GcCTCGGCTC AGATC C AG T A ATCCGTATCA CACGAGGATC AAGCAACTGC 3960 

GTCGGTGTTC TGACGTACCG CGGTGATAAA GCCTGTGCGC CTGCAGAGAT CGAAAAGGGA 4020 

TTGGTGAATG TGCACTTCGT GCAttTTTTG TACCCGTCTC GGTCACGGCA AATGCGGATG 4080 

ATATACCCAT CCTCTGTTTG ACTTACGATG GAGCAATGTG AAGTCTCTCC CGGATCGCTT 4140 

. TTGATGCTGT ACTGCTGCAT AAATCCCCCG TCCCGATCTC GGCAGACTGT GGTATTTCGG 4200 

TAAGAAATTT TATGCTTTTG TGTGTTCGCT TGCACTGTAT TCCGGGAAAA TGAGAACGCG 4260 

CTGTCTTGCC TCCCTGATAC CGGATATAAT GTTTCCCTCG CGTGGCTTGG ACGCGTGGAG 4320 

GAGGATGTAG GGTATGTCAA TCGTGCTGCA GGGAGTTGCC GCAG t TCTGT GCTTTTTTCC 4380 

CTTCTGTGCC CGGTACAGCA ACTGAGCGTT CCTGCGCTGC TTGCTGCTCT TGCAGGAGCG 4440 

GCATTCTCGT GTGTGCTGTG CGTCGCAACG TATGTGCTAA CGGTGCGCCA GCGTGCTGGC 4500 
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GCTTTTGGCG TCGTGCGTAA ATGCATTGAA TACACACCCT TTGTGCTGAT GGCGTGCTTT 4560 

GTCCTCTCCC GCGCGTATGC GCcTACGGTG GCGATGCCGT GGTTAGATTC CCTTTTGGGG 4 620 

ATGAGTTGGA TGGTGCTGAC TCTGTGTGTC TGTGCGCTGT TGTTTTGCCT GAGGAGGAAG 4680 

TACGTACATC TCTTTTTTCC TCGTGGGGTT TCGGTGCACA CGCCCCCTGC GTCTTCGGAC 47 40 

GTGCGGAGTG TGTTGCCGGA TATGCCAGTG AGAAGGAGGC GAGGAATCTT TGTCGTACTC 4800 

GAATGGGTTG ACGCGCTCAC CCAGGCTGCG TGTTTCATGC TTTTGGTGAA TTTGTTCGCG 4860 

TTCCAGTTGT ACGTTATCCC GAGCGAATCG ATGGTCCCCA GCTTtATGGT CGGCGATAGA 4920 

CTCCTCGTGT TCAAGACCGC CTCAGGgCCT GTATTCCCGC TTTCTTCGTT TCGTTtGCCA 4980 

CGCTGGCGTA CCTACAAGCG CGGAGACATC GTCGTTTTTT cCAATCCTCA TTAc CCTGAC 5040 

ACTCCGCCCC CCTGACGAGC ATCACAAAAA TCGACGCTCA AGTCAGAGGT GGCGAAACCC 5100 

GACAGGACTA TAAAGATACC AGGCGTTTCC CCCTGGAAGC TCCCTCGTGC GCTCTCCTGT 5160 

TCCGACCCTG CCGCTTACCG GTTACCTTGT CCTGCCTTT 5199 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12838 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

TCACCCTCTC AAATATCATT CCGCGCGCAC CACATACCCG CAGCACACAC AACTCAACCA 60 

CTCTACCCAT AACC TATACC CCTTGTCAAC CCCCACCACC CGCATAAAAT TCTTTAGAAC , 120 

TCGCCTTTGT ACCCGCACCA CCCCTATTCA CATACAAACG CTGCCCCGGC AAAATACTCC 180 

CAGGAGGAAT CGTGATACAT ACTCACACGC TCTCGCTGAG CTTCATGCTG TTTTCATTCT 240 

TCTTCGGTGC AGGAAACCTC ATCCTTCCCC CCTTACTGGG AAAACACGCA GGTACGACAC 300 

TCGCCACGGC GTTGCTCGGC TTTGCCACTT CCGCAGTCCT CATACCAATC GCAGGGCTCA 360 

TTACTATCGC ACACGCAGgC GGTATTGTCC CTTTGTCAGA AAGGGTAGGA AAACGCTTCG 420 

CTCACTTGTA TCCGGCTATT ACTCTCCTTG TCATCGGACC GGCGCTTTCT ATCCCACGGG 480 

CAGGAATCGT CCCCTTTGCG CTCGCCATCG CTCCCCTCAT CCATCGGGCd AATACCACAC 540 

TACTTGCGCA stTATATATA CAACATGCTT CTTCATTGTT TCCTACTGGC TCTGCATGCG 600 

CCCACACACC TTAAGCAACA CTCTCGGCAA AGTACTTACC CCCGCGCTCC TAGTACTCGT 660 



Printed from Mimosa 02/03/22 07:21:21 Page: 232 



WO 98/59034 




198/13041 



TCTCCTCCTC 


TTCCTTGCCT 


CCTTCACTCC 


GACACTCGGT. CCCTACCTCC 


CTGCACAGGG 


720 


CGCTTACGCT 


ACCCACATAC 


CCTTCAGCCA 


GGGATTCTTA 


GACGGTTACC 


TCACCACGGA 


780 


TGCACTCGCC 


TCCCTTATGT 


TCGGCAATAT 


GATCCTTACC 


TATTTGCATC 


GGACCCGCTA 


840 


CACAACCGCC 


CCTTTCCCTC 


CCACTCCAGC 


AAACACCCCC 


GCAGATATGC 


GCACCGTCGC 


900 


CTGGATAGCA 


GGGGTCATGC 


TCTTTTTTAC 


CTATGGAGTA 


CTGGCGCATC 


TCGGCGCACT 


960 


CAGCGCCCGC 


CAACTCCCCC 


ATACCGTTAA 


CGGCGCGCAC 


ATACTCGCGT 


CGGTGTCACG 


1020 


CCACCTTTTC 


GGAAAAGCAG 


GCATCGCACT 


ACTAGGACTG 


ATCTTTACAA 


TTGCCTGCCT 


1080 


AACTACCTGC 


GCCGGACTGC 


TTGTTTGCGT 


CAsGAATTAC 


TTCCACAAAC 


GCGCACCCCG 


1140 


TGTGTCTTAC 


CTGTGCTGGA 


TACGCCTGTT 


CACCATATCC 


AGCTTTGCGC 


TCGCAAATAC 


1200 


AGGACTAGAA 


CGTATACTGG 


gCATACGGAA 


CACCCCTACT 


CATGATCCTA 


TACCCAATCT 


1260 


CGCTGGTCCT 


CATTGGCATA 


TCACACCTCG 


AGCGACTCAT 


ACGGATACCA 


CGCGCCGCCT 


1320 


ACCGCCTGAC 


AGTATGGAGC 


GCAGGAACAC 


TCAGCACCTG 


TGCAGTCGGT 


ACGCCGCTTG 


1380 


TGGCGCACAC 


CCGGATAGGA 


CACGTGTTGA 


ATACACTcAT 


ACATACCCTT 


CCACTCGCAC 


1440 


AGGAACAGCT 


CTGCTGGcTT 


ATCCCCAGCG 


CGGCAGTTCT 


TATACTTAGT 


ACTGCGCATG 


1500 


CACGCTTACG 


TGAAAAAACA 


TGCACGCCTC 


GCGGTACGCT 


ACCCcTCACG 


GATAACTGAC 


1560 


CACTGGATCT 


CACCATCTTG 


TGGAGATGGG 


GGGAATCGAA 


CCCCCGTCCT 


AAAGAGCGAG 


1620 


TGcTGCGCGC 


CTACAGGTTT 


AGCGGTGCGT 


AcTGCGTTTG 


TCGGACTCTG 


CTAGGCCTGC 


1680 


ACCGCACGCG 


CCAGAGTCTT 


AGCACAGACA 


AAAGTCCCCC 


TACGTCCGCC 


GTGCACAACG 


1740 


TAAGAGCAAG 


CTCCGTTTGG 


CGTCGGGCCG 


ATATGTTCGC 


TCAGAGCAGC 


GCAAACACAG 


1800 


GCcCGCGATT 


ACGCGGCGAG 


CGCGT AG TCG 


AAACTGTCAG 


AATTGGCAGT 


TATAAAGGCG 


1860 


CCGAATCAGG 


AGATCGACAC 


TCCACCTGCA 


GCGCAACACC 


CCACATCCCT 


AGTCGAAACC 


1920 


TAGTCATCCC 


CCACAGACTG 


ACGTCTGCCC 


TTTTCTCTAC 


ATTCCCCCTC 


CCTCACCCTG 


1980 


TGCACCCAAC 


CTAGGAGGCA 


CGCTCTTCCA 


TAATCGCCAC 


CCCATTGCTG 


GTACCAATCC 


2040 


GTGCACAGCC 


AAGCTCGATA 


AAACGCTGCG 


CCTGCGCGCG 


CGTACGGATG 


CCGCCTGAGG 


2100 


CTTTTATCTT 


TGTCTCACCT 


TTCAGATATT 


TTTTAAAGCA 


CTGAATATCC 


CCTTCCGTTC 


2160 


CCCCGCGCGA CGcgTAnCCG GTGGATGTTT TGATAAAATC 


CGCGTGTCCT 


GCCTCCACAC 


2220 


AGGAACACGC 


AAACGCGATG 


TGCATCTCAT 


CTAAGAGGGC 


GGTTTCCACG 


ATCACTTTTA 


2280 


CAATGGCCCC 


GCGCGCGTGA 


CACCGCGCTG 


CAACCTGCGC 


AATTTCGCGT 


TCTACAACCT 


2340 


CTCTTTCTCC 


CGCGCACACC 


TTGTCTATAC 


GG ACGAC CAT 


ATCCAACTCT 


TGCGCCCCAT 


2400 
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CGTCCAACGC ACGctGCGCC TCAGCGCACT TGACCTCCGT GACGTGCGTA CCAAAGGGAA 
AACCGATCAC GCTGCACACC CGCACCGCCG TCCCCCGCAC CGCACCTGCT GctAACGCCA 
CATGGCAAGG ATTTACACAT ACCGACGCGA AGCGATAGTG TGCGCCTCTT GGCACAGACG 
CAACACTTCG GCCTCAGACG C AG AGGGCC T TAAGAGCGTG TGGTCAATAT ATGCATTGAG 
TTCCATGACA CTATCCTCCC TGGGACGGAA TGTCAGCCGT ACCTAGATGG GGAAGCGGAC 
GCGCCCACGC CTcCGCGGAC AATTGGCCGA TCCGCTCCTC CCCTGCTCCG TCATATGCTG 
CAAGCGCAAA AAAATACAAA ACGCCGTTCT GCAGTCCCCG CACCGTATAT GACAAACGCT 
TACCCACCCG AATAGGAGAT CCCGCCACAA AATACATCCC TGACGTGTCG CCCACATACA 
CCACATACCC CTCTACGTCA AAGTCAACCG AAGgTGTgcC ACGTGAGTAT CACCGACCCG 
TCAGCCGCCT GCGCAAAAAG ACGCCCCGGG GGCAAAGGAC GCTCATCTTG CTCATAATCA 
ACGGTTACTG CATGCACCAC CGGTGTCTTA CGACCTGCAC CATC TGG AT A CAATTGCACA 
GCCACCTGGA AGTATCGTCC TTCCAATCCG CTGAGCGGCT GGCCTGCCAC CACCGGCTGC 
CACAACGGGT ACTCCAACGT CCAGTCCTCT TTCGTTTGTC CTACTCGCAC GAAGAACGCC 
ACATCTGCCT GCTCTGGAAT ATCCACATCT GCATTCACAC GCCGGACCAC CGCC TGCAGT 
CCTCCTGCAT CCGTGATCTC CGACTCAAAG CGGCCACCTG CCTGGTCAAA ACGCGCAAGA 
CGGTTAAAAC GGCGCAGc AC CTCTCCTGCT TCAGACGGCG GAACAAATTC TTCAGTAATC 
ACCACCTCAT CGATCAGACC AGAGTAGCGC TCCCCGATAT GTACTGCGGC TGCAGCGCCT 
AAACGCGCAT GCCACACCTG GCCAGTTTCA TCCTGCGAAT CCGTGAG t AC TCAAGACATT 
CTGTACGCCC ATTCATGCGA TACTCAAGCA CACCGCGCGT TTCGTCGTAC GTGAGCATAT 
GGTGGCTCCA cCGCTCTGGC AGCACGTGCG TACGCGaAcG GaGaCGCAAA GAGACTGCCT 
GTCCGCGCAC GTCATTCCAC AAmCCCTCCG CGCGCCAyTC AAGCCGGTGC TGTAAAATGT 
GCGCCACAAT ATGCTGATAA AAAGAACGTC CGCGATCAGA AAGAGAAGAA CGCCACCGGa 
ACAACACCGC CCCGTTCTCA CTCACCGCAG g ATa CAGCC A AAACTCAATG GAGAACGAAG 
ACAAGGCCTG cGACCCATAA AAAAGAGCGC CCGGATTTGG CTGTAGCACT ACCCCTTCTG 
CACCATCTCC CCCCGCGGCC CCCCCATATG CAGATGGCAC GGAGCGGTGC ATCGTGCGAA 
ACAACGcCGC CCCCsCTCCG CGATGCGCAC GCTCTTCGCC CACATG t GCG CAGAGGAGGA 
CTGCACACGA TAACGACCGT ACAAATCGCT GACCAACGGA TcATCAAAAC TAAGATACAA 
GTCACCCCGA ACACTGCGGG TACGc GCAgC AGaAGAAAGT TCAAGCGCCG GATGGCCCGC 
TCGCCCCACA CGCGTACGCA GGTTTTTTAC CCGTGTAAGC GACTGCCACC CcTGCGCACC 



2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
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CCCGAGCAgC AGTGCGCTTC CTTTGCATAC 
aCACGCCGCA CCATACCCCA ACAAGGGCGA 
CCGAGTATCG GCCTTTCCAC ACGAATAATC 
GTACAGAACC AGAGCTGCAC GCAAAGCCcG 
ACCCCAGGGA AAAAGAGCGT CTCCTCAGCA 
AACGGTTCAG CCTAAGGGCA AACCATCACC 
CAAATCCGTC ATATTTTTAA CTACTATCCG 
AGAAAGCCTG CGCAGCGCGT CCTGTACCTC 
CTCTTCGATG CTGATATCAA AAGAACGCGC 
TTCATCCAAC ATCAAGAAAA CATCCCCCAC 
CATAAAACGC CGCTTCTGGG TGTAAATACG 
AATAGCAGGA TTACCCATCA TGAGCACTTC 
AACATCGTCG TACGCAACCG CAGACGCCGA 
TCCAAAAATC TCCCCCGGTT GCAACACATC 
AATTAGACGC ACCCGCCCGC TCTGCACGAG 
GAAGATAACA GAACCCCGCT GAAACTTTTT 
AGCCATGCGA CGCCTCCTCA CAGGAACGCT 
CAGGAGCGGT AGAAAGCGCC TTGTCGTAAA 
CCTGATAGCA CTGTCCCAGA TACATCAGCA 
GGGTAATGCA CTCAGTGAAC GTCTGAATGC 
ACCGCCCCGC ACCCAGATAC GCAGCCTCCG 
ACCGGTAATG CTCATAGGCC TCACCCCACT 
GCAktCCCCC ACTTCAGAAT CGACAGCTGA 
CGCATCCGAA CcTTGCGCAC CCGCCTCCTC 
CTCAAGCATC GACGCAATAT CGTGCCGATG 
CTGCGCCACC TGACTTGCAG CAAGGTAATG 
GTACAAACCC TCCTCGTTAT TTGTCTCCTC 
ATGCACGCTC CGCAGTTGGC GAGAGAACAC 
CGTTTGAGCA AAGGCTTCAA ACTCC TGACT 



233 

AGCACACCCG CaGCATTCCC 
ATATTGTCGC AAAAAGCGCA 
AATAATGAGC GCTCACACAG 
CGCAGCgctT TGTATGTACG 
GACAGCGAAA ACGAATGGGC 
CTTACCCCGG CGCATGGTCA 
GCGCTCTTGC CACTCAACCT 
CCTGTCAGAA AGACCAGCCC 
GTCGCTGCTG CGATCCACGT 
ACGCGCAGTA CTATCTTGAA 
CCGCACAAAC GTTTTCAAAA 
GAAATTCTCC CGATTGAACT 
ACGCGGTGAG TTGTCGAGAA 
CAAGTAGCGC TCCTTTCCGT 
GTAGAAACTC TCCCCCACAT 
GGCAAAGCGC GTAAACGAGG 
GGAGCTCTTT TATGCGCGGA 
AAGAAATCGC CTTATCCGGC 
CCTCCGCAAG ACGCGTAGAC 
TCCGCACAAA CTCCCTCTGC 
CGCCCACTCC CCCGTTCGAC 
TTCCCTGTTG CTCAAGAAGC 
CTCAGCAAAA GCAGAAGGCA 
AAAGCCGCGC CCGACTGTAC 
CTTTCCGTCC GGATACAGTT 
CTCAGAGGCG TGAAAGGCAC 
CTGAGAGTCA AGGAGCGACT 
CTTGAGCATC TTCATAACAA 
CCCAAACGCA TACACAATCG 




(8/13041 



CTCCCCsynC 4200 

CCCATCGCCG 4260 

CTCCCACACC 4320 

CGCACGGTAC 4380 

ACTCCGCTAC 4440 

CGTAGCGGTT 4S00 

TCCGCTGATC 4560 

ACCGGGCTAT 4620 

GCGGTTGCGT 4 680 

TCGTTAAAAT 4740 

GCCGCATTGC 4800 

CCAGCGCCAC 4860 

TAGACATCTC 4920 

TGATAATTTT 4980 

CAAACTCTGC 5040 

CAAAGGCATC 5100 

ACTAAAGACT 5160 

CTCCCCATAC 5220 

TTCGGATTGC 5280 

TCAAAAAGAC 5340 

GCGCACCCTA 5400 

TCTGCGGCAC 5460 

CCTCAAAACC 5520 

CGTCGGCGCT 5580 

CCCGGTAACG 5640 

GGGCAACGGT 5700 

CAAGCTGCCG 5760 

TGCGAATGTT 5820 

AATCCACCAA 5880 
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GGTGATCGCA TTCTCTTCAC GAGGAAAATT ACCGAGCGCA GACTTAACCC CAAAGAACTC 5940 

ACCAGTTTTA ATGTACTCAG TCACCTGCGA CCCTGTTTCG ACATCCGCAA AGGTAAGCGC 6000 

CACGTGCCCC TTATTCAGGA TCATAACACG ATCGTCAAGA TCGCCTGAAA AATAGATAAC 6060 

CGAATTGGCC TTATACTGAA TGGCTTTTGG CACGTCTGCC TCCGTCTTCC ACGCGACACC 6120 

GAGAAAAAGC GTGGCTCCCA CGGTACATTT TCGATAGAAC GGTCATGCAC TTAAGTCTTT 6180 

TTCCAGAATT CACGTCACGC GCTCCTGCGg CnACACCTGC AGAGATCGTT CGCACGCCTT 6240 

TTCCAGGAAA GGTGGGTGTG CTACTATCGG TATGTGCAAG TTCACTCTAT CTGGAGGCGC 6300 

TTCTGTGCGC TCGGCCTGCT GGTGCCCTTT CTGCTTCTGC TGTTTTCTTG CACCAACACG 6360 

GTTGGCTACG GCGTCCTCCA GTGGTCCCTC CCAGATCTGG GACTGAGTAC AGGAGACATC 6420 

CTGCCGGTGT ACGTGCGCTC AAACGTCTCC CAAGTGTACA TTGTGGAAAT CCAGAAGAAA 6480 

AAGGTAGAGC TGCCTTTCTG GCAGCTAAAA TTATGCAGGA CAAAGAAAGA GGCGCTTCAG 6540 

TACGCTGAGC GCCTCCGCGA GTACCGTTAC AG tACGCCAC CTCTGTGCTC GACGGTCTGC 6600 

CCCTGCGAGA AGGGCCTGAG AACACTGCCC CCCAAGTTTA TCGCCTCCGC GAGGGACAGG 6660 

CGGTCAAGCT ATTGTGGAAG GGTACAGGGA AGGCCGTCTA CCGCGGTGAA AATCGCCTCG 6720 

AAGGGGATTG GTTCAAGGTC ATGACCGAAG ACGGTACCAC CGGATGGTGT TTTTCTCACG 6780 

GTCTATCCCT CTTTGATGAG CGCGAGTCGC GTCCTACAGT ACGAGAAACG GACGATCTCG 6840 

CACGTGATCG CGACCTTCAG CACGTACTCA ACTCTGCGTG GTATCCTGAA TACTACCGCA 6900 

CCATGGTTGA ACAGCGCCGC ATCGACTTAG AAAAAATGGC AAGCGGCTGG GGTTTATTTG 6960 

TCGGTGAGAA AAAAGGCCTC GCACGCATTG AATTGCCCGA TGCGCAcTAC GCCTTTCCCT 7020 

ACTCCCGTCT GGTAAAAACC GGATC C AACG GGTACCTCTT TGACGGATCC TCTCTGAGCA 7080 

TCTATGTTCG GGACGCGCAC ACCCTTGCCG CGCAGTTCAC TGACGAAGCT GGGCGCcTGC 7140 

GCATAGAACG CTTCGTCACC CTGGAGAAAA CGCCTGAAGA GATTATCGCA GAAGAGCAGC 7200 

TGCGGCGCAG TGCGCTTTTG GAACACGTCT GCACACCAGG ctGCGCCTTC ACTCTGAGAT 7260 

AT ATGGGAC G CTGTCTTTTA CAGAACGCAA CGTCTTCACC TGGACAGGGG CGCGCGCGCT 7320 

GTCCCCGGCG CTTATCCCCG CAGGGGCAGG GAGCACGGGG CGTGTAGCAC TGCGGTGCTT 7380 

CATAGATCAA TCGCTGAAAA GCGAGTATGA AGGGGTGCTG TCCTTCGACT TTGaCAGCGC 7440 

GCAGGAATGG GTGCACTTCC TGT AC TT ACG CACCCCCGGG GGGCTAAAGC TCGAACACAT 7500 

AGACTCCACC CACcTGAAGG ATGCGACAGT GTCCGCAGGA GCGTAAGCCC AGTGGTACTC 7560 

TAtTCGCGCC GGAAGGACAC GCCGAGCCCC AACCCTAAGA AGCAGTCAGC CGTCGGGAAG 7620 
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GGAGACGgCC GCAGGCCCCT 
CAGCTGCCAC CCCGGCGCGG 
GTCCCGAGGA TCGGTCTTGC 
CATCTTACCC TCAGAACGAT 
ACATGATCCG GATCAGCCAT 
CTAATGCGGT CTGGATGCGT 
TATCCGCGGA TGCGCGAGCC 
TTGACCACGT ACTCAGCACG 
CCTCGGGGAC GGGCGTCATG 
CGAGCTCCTC CGCCGGAGcc 
AGCAACAGGC CCGTACTAAG 
TCCGGACAAA ATAGCGGCCC 
CCTCATGGGG AAAATCATAC 
CTGCCGGGAG CAGCGTGCTC 
CATTAAGGAT ACGCACACAA 
GTTCCGCCGT TTTTACTCTT 
CCTATCTTGT GCAATTTAAA 
ATCCGGACGA AGTTATTGAA 
CCAATCCCCT TTATGCACTT 
CACTCTTCAT GATGCACTTA 
AATTTGACAA AAGGGAAGCC 
GCCTCGCTAC nGCCGAGCGG 
TCTGGGCCGA GCGCGCAAAC 
CTTGGGAAGA CGAGCGAGAA 
CGCGCGAGtC GCACGTCTCG 
CTGAGAACGA ACTCCACCGG 
AGAAGCCGGT CAATAGCGGG 
CGTGAGCACC ACX5TTTGCGC 
GCGCGCGTAG CTAGGCAGTG 




W13041 



TTTGGGAGGA CGAGCGAGCC CTGTGTGCCC GGCTGCCCGC 7 680 

ATTTGAGCAA ACTGGCGCGC AGTGACTCCT AGCGCGGCAT 7740 

CGATTTTTTC CCCGAGTTTG TTCATAAACT CAGCTAGCTT 7800 

AGATACGCTC TACCTGGGCG AATGCCTCAA GCCCTTGCGG 7860 

AAACTCGTCA AAGTTCGGAA TAACCAACTT CATACCTGGC 7920 

TACCACGCCC TCACTACAGG CCATAATAAT GGGGAAATAG 7980 

GTAAAACTTT TTAGCAATTT GAGAAAGGGT GTCTTCGTTT 8040 

CGACACCTCT TGCGGAGgAC TGCCATAGGC TCcTGCGGCT 8100 

GGCTCTTCTT CCTGCTCCTC CACAGGCGGC GCAACTTCCA 8160 

AGACTTGCAG GACGTAACAC CCGAGATAAA CGCAATCCAC 8220 

CTTCCTCATC ATCGTCTCCT CCTCCAGACT GCACGCACAA 8280 

ATTTTCACAC AGAACCCCAA TGAAGACAAG ATACATGCCA 8340 

GGCATACTCT TAGCAAGCGC ACACACCGTC CTTCCCACCT 8400 

TCTTTAGGTC CATTTCATCG GAAAGGATGG CACTGCAATT 8460 

TCAACCGAAA CAGTGACAAA CGTGGGTGCG CGTAgyTTTC 8520 

CTCACCACTG TCTGTGCGCT GTGCGGCGAC CCAGCGCATG 8580 

GAGCAATTTT ACCGTCTGTA CCATACACAC CTGCACCAGT 8640 

AATATTCATT GGCTTGAGCG CGCTGTACAC GCCGACTTCG 8700 

GCGCCTATTC GCGATAAAAA AAGCTGGGAA AAATACCGCG 87 60 

AACCTCAAGC TGACGGAGCA ACATTTGCGT CTTGGCGAAA 8820 

CTCTTTTTCA ACGCACCATG GAGAGAAGAG AATATCGAAA 8880 

TGCTACCACA CTGCACGGCG ATACTGGCAC GAAGCGGCGC 8940 

GCCGATCAGT TTCGTTTTCT GTTCCTCACC GAGTTGCCTG 9000 

cGCATCGCGC GCGGC ACGy t CAACTATGCG CGTACCATAT 9060 

AGGCGGTGCG TGCTCgCTTC ATGGAGATGA ACGACACCTA 9120 

TACCAGGCGA CACTGAACAC TTACATCGCC TGGACTACAC 9180 

CGGGTTTGCC CAGAAACTTt ACCCCAAGAT CAGGAAAATC 9240 

CCGTCTGTTT GAACAAAATG GAAAACATCT CGTCCaTGGT 9300 

TTTCTTTCgT ACCGTGTGCA CATGACATTC CAATTTCGCA 93 60 
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TCTTGGATTG CAGAAACCAT CGGACTCAGG CGAACAGCGC CCACCTTCGA CCATTCATTC 
TCTATGAGCA TCCTCCAGTC AGGACCCACG CCGTCTGCAT ATTTTGCTAT TTTCTGCATA 
CCACCGGGCT CAAACATCCA ATTGTAATTG TAGTTTATCC ATTTCCCACG CGAGTCCTTC 
TCCTGTGTTT CACGTTGATC TGTGTAAGCA ACACGCTGAA TCAGCTTCAC GTTCATTTCG 
TACTTTGGTA AAAGTTCTCG TTTGATACGC TTCAGCTCGT TAAAATCATA CGTTTGCACA 
TACACTAGAT CCGATCGACT TTGGTAACCG TATTTTTTCA ACAGAGCGAG GGTAAGCGCT 
GCGATGTCTT TTCCTTCCTG ATGATGAAAC CACGGCACCT TTATTTCAGA GTAAATTCCA 
ATCTTTTTCC CGGTTGTCTG TTCCAACCCA CGGATAAACT GCAACTCCTC TTCAAAAGTG 
TGCAGCCTAA AACCAGGCTT CCAAAGAGGA AAGCGCTGGC CATACACCGG CGTATGTCGC 
TTACCGCGCG TATAGAAACT ATTGGTTGCA CGGAGGAGGG AAAGTTCTTC TACCGTAAAA 
TCTATGACAT AGAAATGCCC ATCCGCACGC TGCCGGCGTG GAAATTTTTC TGCCACGTCA 
GTCATATTAT CCAGAATATG GCTTTGCGCT ACGATAAGCT GATTATCCTT TGAAAGCACG 
ACATCCTGCT GCAGGTAATC TGCTCCTtGT GCAAAAGCAA GAACTTTCGA GGCAAAGGTG 
TGCTCGGGCA CATATCCTGC AGCGCCCCGA TACGCAACTA TCATACGTTC GGACGCACAG 
CCTGCAACCA AkGCCGCAAA CACCCCCCCC CAAAGCGTCA CACAATATGT TCCCCGCATA 
AACTTCTCTC CTCCCCTGTT ATAGAATGCA CAGATCGTCA CCGTGCATAG TAGCACGCCG 
CATATCTCCA CTGAGCGCTG ATCCTACCTT CAGTTTTAAT CCCATTCACG ACATGTCTTG 
CATACAGGGT AACGGGTACG CCCACACACG CGCTTGAGAA TGGGGAAAAA AGTCTTCAAG 
TAAGAAAAAA AAGAAGCAAG AGCCAATACC ACTGGCACCA CGTACACCAG GCGGCCAACC 
GCACGCATGC GCTCATACCA ATCCGCGCCA GCCAATTCAA AGGCGTACAA TGCTTTCAAA 
AGAAGCGAAA AAAGGACCGC ACCCATATAC GAGGCTGTTT TCAGCTTTCC CATTCTCTGG 
GCACCAACTA CGTGACCTTC GCCACACGCA AGCATTCTCA GGAACATCAT TCCAAATTCC 
CGATACAAAA TACACAGAAA AAGAAAAACC GGCATGAAGT TGTCTGCCAC GAGGCAAAGC 
ATGACAGTTA CATTCGCTAT CACATCAGCA AAGGGATCGA AAACCTTCCC AAAACTAGAA 
TACTTCCCTG ACTTGCGCGC GTAATAACCG TCGAGGAAAT CAGTGCACGC AATGAAAAGA 
AAAAGCAGCA CCGACGCGAT AGACACCACA CGCCCCACAT TGGcAGCAGG AAAGTACATA 
ACAACCCAAC GTGACATATG ATAGAGTGCA AAGAAAGGGA GAACCAACGC CAGTCTCAGC 
GCAGTATAAA AGTCAGAAAG CCTCATACTC ACCTAGGGGA AAGACGTAAT GGTACACCAC 
AGGTGT ACT c CTAGCCTCGA ATTTTGTATC GCCTATTGAA CCTGTCAATG CGTCCTGCTG 



9420 
9480 
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9600 
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9720 
9780 
9840 
9900 
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10020 
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AATCTACCAG TTTCTGTTTA CCGGTAAAAA ACGGATGGCA CGCAGAACAA ATCTCTACCC 
GCAAGTCCTT CACGGTAGAA GCAGTCACGA TGACGTTACC ACACGCACAC ACCACCTTCG 
TCTCCTCGTA CCGAGGATGC AGTCCCTTTT TCATCTGAAT CGCTCCTTTC CCCATTCTAC 
AGcGCGCCGC GCCCTTAAAA AGGTTACATC CACCCTCTTC CAGAAGGCTC GTACACGCCG 
GGTGTAACCG ACACACGTTC CAGTCTCCTT ACTCAGGAGG AGAACGCACC CGTATTCATA 
GACCTCAAAA ATGCCTCGTT GTTCTTTGTT TTACGCATCT TATCAATTAA CAATTCCACA 
ATTTCTGCAT CGTCCATAGG ATTGATTACC TTACGCAACA CCCAAATACG CTGCATTTCT 
TCCTCCGTCA GGrGCAACTC TTCCTTACGC GTACCAGACT TTTTAATACT CACCGCGGGA 
AATAGGCGCC GATCCGAAAG GCGACGATCG AGATTTATCT CCATATTCCC CGTACCTTTA 
AACTCCTCAA AAATAACCTC ATCCATCCTA CTGCCTGTTT CAATAAGCGC AGTGGcAATG 
ATTGTCAGAC TTCCTCCTTC CTCCACATTG CGAGCTGCAC CAAAGAAGCG TTTCGGTTTG 
TGCAGAGCAT TTGAATCCAC TCCCCCCGAC AACACTTTAC CTGAAGTTGG CATCGTTTGG 
TTATAGGCAc GCGCCAGACG CGTAATCGAG TCAAGCAAAA TCACCACGTC CTTCCGGTGC 
TCTACCAATC GCTTTGCGCG CTCAAGCACT ATCTCTGCAA TCTGTACATG GCGAGTAGCC 
TGTTCATCGA ACGTAGAAGA AATAACTTCA GCATCAACCG TACGCTCCAT GTCGGTTACC 
TCTTCAGGAC GCTCATCGAT GAGCAGCACG ATAAGATAAA CTTCAGGATG ATTTTGCGTG 
ATGGCATTTG CAATTTTCTG CATGAGAATC GTCTTTCCCG TACGCGGCGG CGCTACAATC 
AGCGCACGCT GCCCCTTTCC GATCGGACAG AACAaGTTCA TGACACGCGT TGAGATATCT 
TCCGTTCTTG TTTCTAAATT CAGCTTTTCC CGCGGGTACA AAGGGGTAAG ACTGTCGAAA 
GGGACACGGT CCTGTACCTT TGCTACTTCT TCGAAGTTTA CCGTTTCCAC GCGGAgcATT 
GCAAAGAAAC GCTCTCCCTC CTTAGGGGAG CGAATCTGCC CATAGATGGT GTCGCCCGTT 
TTCAGATTAA ACAGGCGAAT CTGACTTGGA GAGACGTAGA TGTCATCGGA ACCGGGCAAA 
TAACTGTTCT GAGGTGAACG CAAAAAACCA TACCCGTCAG GCAATATCTC CAGCGAGCCA 
GAAGCAAAGA TAACGCCACC ATTCTCAGTG TGATTTTTAA GAACGTGAAA GATGATATCT 
GACTTTTTCA TGACAACCAC GTCTTCTTGA GAGATACCCC GCTGTACTGC AAAATCACGC 
AGGgCATGCA TCCCCATCTC AGTTAAATCA TCAATCAGCA AACGCGCCCT ACCCTTAACG 
TCGGCGGAAG ACTCACTTGT TTCCGCATCT TCTGGGCAGA AATTTTGCTT AAAGCGTAGG 
GCTCTTCGCG GACGTTTTTC CACCTCCAGC GCTTTTGGCG TCACCACTCG TCTCCGTGAG 
CGAGAGsGgA TGACGAGCTT CTTCCTCCCG CCGTAGATCG CATTCCCCCA CGTCAGCG 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

TGCGCGTGTG CCACGCACAC CAGTACGTAT GCTCGAGCAC AGTGGTCACG TGATCACTGA 60 

TGACGTGGAG CGGGAGCAGG TTGCCTCTTG TGTCAGTGCT TTTTTACGCA CGTAnTtTAC 120 

GTGATGTCTA CCCAAAAGGG AGTGCGGTGC ACGngTGCTC CACATTTCTA GTTGCTGTAG 180 

GGGGAAGAGA CTGAGTCGCT GTTTCGGACC GCGTGCGTTT TATGTGCGCG CGCTGTGCAT 240 

CGCGCTCCCC GTGATGCTGC ACTCCTTCAT CCAGACGGGT ATTTCTTTTT TAGACAACGT 300 

TATGGTCTCC CGTTTGGGGG ATGTGAAGAT GGGTGCAGTG AATGTGGTCA ACTCGCTGCT 360 

CTTTCTGTAT GTCACCGCGT TAATGACCGT GTCGAATGCA GGCAGCGTGT TTATGACGCA 420 

tACTCAGGAG CCCGTCACGT AkGGGCATGC GGCAAAGCTA CCGATTTAAA CAGTACGCCA 480 

TGGGGTCTCT GGCGCTGGGT GCTATGGCCG CTGCGCTGTG CTGTCCTCAG TATCTCCTTT 540 

CGTGTTTGTT GGGAAAAAAT GCGCAGGCTG CTCAGATTAT AGCGGAAGgT GAGCGTTACC 600 

TTTCGATAAT TGTGTACACT CTTGTGCCGC TGTCATTTTC TTTGGTCCTC ACCTCTACAT 660 

TGCGAGAAAC AGGGAAGGTG CTTGTACCGC TTGCAGTGTA CGGGTGCAGT GCCGTATTGA 720 

ACGCAtGrGT aATa TATGTT GATTTATGGA AACTGGGGGG CTCCGCGATT AGAAGTGCAA 780 

GGTGCAGCAT GtGCAACGCT TATAGCGCGG GTGGTAGAAA GTCTTATGCT CCTGGTGTAT 840 

GTGCGGGTTA AAAAACCGGA CTTTTATGTG CGGCTTTTTT GTCcTGTGCG ATACCCCTGT 900 

CACTGTGTAC GGTGATGCTG AGAAAATCGC TGTGGATTTT CGTAGGAGAC ATGGCATGGT 960 

CGGTAACGGA GATGGCCGTG GCTGCCTTGT ATCACAGCCG TGGTGGGGCT GAGGTTGTGG 1020 

CAGGGATGTC GGCGGGGTGG ACACTCGCGC AATTATTTTT TCTATCATTC CCTGCAAGTA 1080 

GCGTGGCAAT TACCATTTTG GTCGGGGATG TGTTAGGGAA AAGCGAGCTA AAGCAGGCGC 1140 

AGGATTATGC cACGGTGGTT GATGAACGGA GCGTTCTTTT TAgGGTTAGG TTTGGGTGTG 1200 

A t TGTGTGTG TAGCGCGTGC aGGGATTCCG TGGGCTTTTG GAGATTTGTC GCaTGcTTCG 1260 

CAACGTATAG CACAGCAGTT GGTGCTCGTG ACGGCGCTGT ATATGCCGAT TTGGATGTAT 1320 

TTAAATGCGC AG T ATGCGGT GGCACGTGCA GGAGGTGAAG TGATGGTCAC CGCGTGGACA 1380 
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GAAACGTTGG TAGATACCCT GTTGTTTTTG CCGTTGATGT ATGTGTTGGC GCGCTTCACT 1440 

CAGCTTGGTG CGCCGCTTAT GTATGGAATA GTAAAGAGTA CAAGTGTAGT AAAAATGGTG 1500 

GTGCTTGCAC GTCACTTAAA AACACGTCGT TGGGTGCGTA ATCTCGTGGC GAATTTATCG 15 60 

TGATACGTGT T ACG TCCTTC GGGTAAGGCA CGGAGCATGG AGTGACGGAA ATCGAAGCAT 1620 

GTATAAAAAC TGTCCGCACG CCCTATAGGC GTCTTTTTGT TTTCCCTTCG CGTATTGTTG 1680 

CCCAGGGGTG GCTGCGGCAA AGTCTTTCTC TGCTTGGTGT GCGTACGGTT CCGGGGCGGT 1740 

TGTGTCTTTC TTGGGACGAG TTTAAAAAAC GATGTTTTCA ATGTGCGCCG TGTGCGCACC 1800 

GTACACCTAT TTCC GAACCG CTTCGTCTTT TATTTGCACA CTCTGTGGTG CAGCGCAATG 1860 

CGCGGCAGGC TGCAGAGGGG CGCGCACTTT TTTGCAACCT TATTCCTCCG GCATATGCGC 1920 

AAGACGGTGC AGTGTTTGTG CAGTGGCTTG CCCGTATACT CCCTCAACTT GGATCGTGGC 1980 

AACGGCGCGT TGAATCGCAC TGCATGCCTC CAAAAGATGC GGTATCGCGT AATACGTTTG 2040 

ATGGyCsCGA CGCGCgCGCG TATGCcAGAg TGCGGAGGCG CAAGATTTAC AAACGCTGAA 2100 

AGgCAcTATG AGCAATTTCT CCGCGCGCAT GCACTCTTTG AACCTTCCTG GGATACGCCG 2160 

CAGTTTTGTG CGCAGGGGAA CACGTATGTC ATTGTATACC CGCAGCTGAT GCAAGACTTT 2220 

GCAGAGTATG CACCGGTATT GCAAGAAGCG GCGCGCGCCA CTGCGGGAGT ACTCACCTTT 2280 

CTTCCGGTTC CTCCCTTTCG GCAGGATACG CCGTTGTGTT GTTTTTCGAA TGTACGTGAG 2340 

GAAATTACTG CCGTTGCGCT CCAGGTAGAG AGGTTGTTGC GCACGGGAAC GCCTGTGTCG 2400 

CAGATAGCAG TTTCGGTGGC AAATTTAGAA GAACTGCAGC CATATGTGGA GCGTGAGTTT 24 60 

CGTCTGCGTG ACATTGAGCC TGAGGTGCGG GCAGGTTTTT GTCTTGGTGC TCATCCGGCA 2520 

GGGAGGATGT TTTCCCAGCT TCGAGAGTTT GTGCGCAGTC ATGgCACGCT CAAAAGCGTG 2580 

CGGGcGCTGC TTTTGAATCC GCATATTCGC TGGGCGGACC CCCAAGGGGC ACAGGCTGTG 2640 

GTGCAGTACG GATTGCAGCA GGCGTGCATT CGTTCATGGA AGCAGAGCGG CACGTATTGC 2700 

AACGTGTGGC TCCAGGCATT TGCGCTCCAC TGTGAGCGCA CAGAACAGGA GCGACAGCAC 2760 

CAGCAGTGTG CGCAgcGATT TTTTCTGACG CTGTTACGTT TTGCGCGCGC GTTGGTTGAG 2820 

GCGCGTAgTT TCGTACGGAT GCAAAAGGCc TACGGTGCGT TTCGTGCCGC TTGTTTGCTC 2880 

CCCGCGTCAG CraCACGTCT GGTGCAGAAG AGGAGATAGC ATCGTCTGCG TTTGCGTCTT 2940 

GCAGTGCTGG GGAAGACGAT GCGGTGATGG CGCGGTGCGT CTGTGTTTTG CAGGAGTTAG 3000 

CGGCGCTTGA GCGGCGCTTT GCACACGTGG TGCCACCGGA TCCAT AT AG T TTTTTTGTAC 3060 

AGCAATTGGC AC AG CAGATG TATGTACCGG TGCGTGCAGG GGTAGGACTG GCGATTTTTC 3120 
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CCTATCGGGT TGCGGcGctG CGCCCTTTTT GCATCACTTT GTGATAAACG TGTCGCACGA 3180 

AGCGAGTAGC GTGCGGTATC AGCGAGGTAC CTTTTTGCGC GCGGATGTGC GTGCGGCATT 3240 

TGGTTTTGAA GATGAAGACG TAACAGAGGC TTTCTTATCT GCGTACGCGA CGGCACAGAC 3300 

GGTGTATTTT TCCTGTTCTG TGCAGGCGTT TTCAGGGGTG CAGCGCCCgA ATCGTTTTTT 3360 

TTCAAATGTG CATCCGCCTG TGTCCTCCAC CCTGACGGGG AGGGCAAGTC AAACACCGTC 3420 

CCCAGCGGGC ATTGCATCTG AAACTGGTGG CGCGGTGCCT CAGTATCCTG CGGTGCGCTA 3480 

CGAGGAAGAC GCGCTGCAGG CAGAGCAGGA CCTGTACGCA CAGGGCGCAC CGGTGCCTTC 3540 

GTCATTGTAC AGAACACAAC AGGAGCGTTT ACGGAAAGCG GCGTCTCTTA TCCCTGCAGC 3600 

GGGGCGTTCG TACATACGTG ACTCCTTTGC GCAGGCGCTC CCACCGCTGA CTGCAGTACT 3 660 

CCATGCGCGT CATTTTCATC ATGCGGCGGT CAAGGTGAGT CAAACCGATC TTAATCTTTT 3720 

TTTTCGGTGT CCGGCTGCTT GGTTTCTTGA GCGTGTGTTG GACGTGGCGC CGCTTTCTCG 3780 

AAGGCCGCGT TTAGTGGATC CGCGCGTGTT GGGGGTTTTT AGTCATGTAG TACTCGAGCG 3840 

GCTGTACAAT AGGATTGCGT GCGAGGACGA GTGTTTTTTT TCTGCGCACA TGGAACGCTA 3900 

CCGTTTATGG ACGCAAGAAG CGATTGAGCA AGTCTTTTCT GAGCGTGCGG TGCGTGCCGG 3960 

TCCGCTTGTG TGGGCGTTGC GCGCGGCCTG AGCGCGCGCA TCCGGCACAT GGTGGAGTTT 4020 

GTATTGCAGT TTGATGCGCA GCGGCTTGAC GGcTGGCGCG TGGTACGTAC TGAGAAAGCG 4080 

TTTGAGTTTA CCGATACGCA gTGTTTCTAC ACGGGGTTGG TGGATCGCAT TTCGTGCAGT 4140 

CCGGACGCGC GGTCGCTTGc AGTGTTAGAT TACAAGACCG GGGCGCTCCC TGCGCTTTCT 4200 

GATTACACAG ATTGTGAAAA GAAAGGTCGG TTGTCTGATT TTCAAATACC TATGTATGTG 4260 

TATCTGTTGG AGCAGGCGGG GTATACCGTG ACGCACGCTT TTTTTTTAGA CGTGAGAAAG 4320 

AGAGATTTTA AGGTTATTGT TTCGAACGGG CGGGTGGATA TGGGTGCAAA GCGTGGGGTA 4380 

GATACCGTAC AATTTCAGGC GGTTATGCAG CGCTTTGAGC AGTCAGTGGC AGTTTTTTCG 4440 

AAGGCGGTGC GCCAGGAATG CTTTGCCAAA GCGCCGTATG TCACCTGGCT TGAATGTGCC 4500 

TCGTGTCGTT TTGCGCCGGT GTGTCGTACC TCGTATGTGG TGCGTGGGGC GTCGTGACTG 4560 

ATTTTCTTTT TTCTTTTTTT CAAAGTTTGA ATGCAGAGCA GCGGCGTGCA GTTTTTTCTT 4620 

CGCATAATGC AGTTGTTACC GCAG t GCAGG TTCGGGTAAG ACGAAG gTT A TTAGCGCGCG 4680 

GTATATACAC CTGGTTGTGG AGCGGGCAAT TCCGGTTGAA CGGATTGTGG TGCTCACCTT 4740 

TACCAGAAAG GCGGCCATGG AAATGGCGCG CAGAATTTAT GAGGACCTCC GTCTGTGTGT 4800 

ACAGAGTGCG TCTGCGCAgc CGGAGCCGGG GCACGAAGCG TATCTGCTGC GTGCGCGTGA 4860 
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GGCGCTTGCg cGGTTTGGGG AAGCGCGCAT TATGACGTTA GATGCCTTTT CGCACGAAAT 4920 

TGCGCGGGTA GGCGCGCGCT TTTTCGGTAT CGCGCCTGAT TTTTCTCTCA GTGAGGAAGA 4980 

GAACCGCGCG CTGGCACACG AGTGTGCAGA AGATTTTTTT CTTGAGCATC GGGAACATCC 5040 

AGTGGTACTG CATTTTTTGC AGCAGGAGCA CGCCGAAGAC TGCGTGCGAG AACTTTTTTT 5100 

TATTCCCTTG CAGGATCACG GCATACTTAC ACATCCCTGT GACTTTCGTG CAGGGCTTGC 5160 

GCATCAAATT GCTACAGCGC GTGGGTTATT AAAAACGGTG CTCTGCGATA TACACGCAGC 5220 

ATTGCACGCC ATTC GGC AC C ATATGCaAGA GGCAGATGCG C AG AATGCg c TnCATTGCGC 5280 

GCTGCGTTGC GCTGTTTGCG GCACAGGATA CTGCCTTTTC CTACACGCCG GCTGCAGAAG 5340 

CAGATGCGAT TGCCGACGCG TTTTTGGCAC GTGGGTACGA GGAATATGCA GCTAAACCTG 5400 

ATGAGTTTTC TGTGTCTGAC CCTGATGAGG GAGCGcGGCG CcTGcACACG ATTGCcTGCG 5460 

GTATTGTGCG GCGGTAAAAA CGCTTTTTTG TCTGAAGGGT AATTTAGGGG GGCGCGCGGG 5520 

TGCAGCACAG GCGATAAAAG CACAGGTAAA GCAGCTGCGT CTTCAACTTG TACCGCAAAT 5580 

GGAACGGCTG CACGCGTTTT TTGCGCAGGT ACCGTTCCTT GTGGCACTCA GCTCGTTGCT 5640 

CGAGCTTCTG CAGGCGCGTT TTATCCGGCA AAAACGGGAA CGGAATTGTC TCAGTCACGC 5700 

CGATGTGGCG CATCTTGCGG TGCAGGTGTT ACGTCAGTAT CCGGAAATAC GCGTTTCTTA 5760 

CAAGCGGGGT ATCGATGCGT TCATGATTGA CGAGTTTCAA GATAACAATG CCCTCCAGAA 5820 

GGAACTTCTT TTTTTTCTTG CCGAGCACGA AaGCgcGCAC CGCGCACTTC CTCCCTCCTG 5880 

CACATGCGTT GTGCGCACAC AAGTTGTTTT TTGTGGGAGA TGAAAAGCAG TCGATTTATG 5940 

CGTTCCGGGG TGCGGATGTG CGGGTATTTC GGTCTCTGGC AGGCGTACTC ACCCCGCAGg 6000 

TCAGTGGCGC GTCCCAGCAG GAGCTTCCTC TTTcCGCTGC TGCGGAGCTG CAGCCCACAC 6060 

TTCAGACGTT GCGTATCAAT TACCGAACAG AAGCGcGCTC CTTGAGCGCC TCAACATACT 6120 

GTTTTCACAT ATTTTGCGTG GGCCGTCTGA GTCTGCCGAG AACGGGTACG AGGTTGGGTT 6180 

TCAGTATATG CAGCCGGCCC GGTGTACTGC CGGTATTGAG CCGCAGTTTC GGGTGATTGG 6240 

AGTGGATCGT CACCGTTTCT CCAGACCGGA GCACGAAGCG CAgcACTCAG CGGCGCGCCC 6300 

AACTCCTCAA GCAGGGAGGA CAGGCGCGTC TGAGGACTCG GAGGATTCTC TATCGGCGCA 6360 

GGAGACAGAA GCGTGGGnGC TTGCGCGTGC TATCCGTGCC ATGGTGGACG GCGGCACCCT 6420 

GGTGCGCCAC AAGGGGGAGG CGCCGCGCGC GTGCACGTGG GCGGACGTAG TGATTCTGTT 6480 

GCGTTCTGCA GACAAGCAGG CGCGGTACGA GCGCGCGCTG CGTCTGTGGG GTATTCCGTA 6540 

CACGTCGCTT CAAACGCGGG GTATGTTTTG CGATGCGCCG CTGTCTGATC TCCTTGCCCC 6600 
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GCTGCGTTTA GTGCTCGAGC CTGCCGATCG GCATGTGTAC GCGCAAGTGC TCCGCGGTCC 6660 

GTTTGTACGG GTCGATGACG ACACGCTTTC TCTGTTGCTG CTCCCACCCG CACCCCCCGA 6720 

CGCCCCTTTT TCGTATATCC CCGCGGAGTT AaTCcGC t GC GGCGCGGTGT GTACGTGCAG 6780 

GCGCGGACTT TTTTGCGCGC GTGCAGCAGC AGGTGCGGCG CCTGGCGACG AATACCGAGC 6840 

TGCTGACCTA TCTGTGGTAC ACCGAGGCGT ATGGAACGCT GTTGCGCAAG ACCCCCTGGC 6900 

GCGTCCGTAC CATGTGATAT ACGACTATGC GTTTGAsTTG CGCGGCGGGC AGATCGGCAG 6960 

GGAAAGGGGA TAGGAGAATT TTTAGATTTT GTAGATGCGT GTCTGTCTGC CCAGGAGCGG 7020 

GTAGAGGAGC TGGAGCTGCC TTGCACGGAC CGCGCGTGTG GGGCAGTGCA GATTATGAGC 7080 

GTGCACAAAA GCAAGGGGCT TGAGTTTCCA ATTGTGTGTG TGCCGGACGC GGGGAGTTCT 7140 

GGACCGCGAG TGATGGC gCG CGTAGgcGCG GTACACTCCC CGTACGGATA CATTCCCCGA 7200 

TTTTTGCCTC ACCCTGAGGG GGTGCATCCG ATCTTTGTGC AGGAACAAGA CACGCGCGCC 72 60 

CGGGCGTACC GCGCGGAgcT GCGACGCGTG CTCTATGTGG CTTTCACGCG GGCTGAGTGC 7320 

CaCGTGATTG TCAGCGGGGT ACTGCCTATT TCTGACGGAC ATCCTGCTCC TGCCGTTTCT 7380 

CGGTCGTTGG CGGACATCTG CTCTCTGCTC CCCTCTGGTG ACGGGAGTGA GCCTCCTTCC 7440 

TCCCTCTCTT TCTTTTCAGA GCTGCTCCCT GCGCTTATGC ATGCAGCCCC CCTTCCTCCC 7500 

CATCCTTCTC CCTCTGTGGT GCCCGCACCG GTGTCGTTTG ATGAGTGTCt GCnTGGCGCG 7560 

CCCTCAGGCG TATCAGCGCG CCCTCACCGG GTCAGCTCGC CAAAGGTGCC GCAAGCATCC 7 620 

GCCCCTCGGG ATTGGTACGC TGCAGTGCCT GTTCGCGCAC C C C ATT ATT A TCCCCgTCTT 7680 

GTTCAGCCGG TGAcGTCCCT GGTTTCTCCG GCTCCGGGGC AAAACTCGGC TTCCGCTTCT 7740 

CCTTCGCCCT TGACCCCGCA GTCCCCCCGT GGCGTGGAGT TTGGCACGcA CGTGCATGAG 7800 

CTTTTGGCGC AGGTTTTCCA GTCCCCGGCG CCGaATCkTG' CACTCCATAG CGTACAACGT 7860 

GTTGATTcCC CTGCGGCACg CCTTGTGGCC TGCTTTCTTC ACTCCCCCTT GGGCTGCCGT 7920 

GCATGTGCAG CCCCCGCTCA CCAGCGTTTT GCGGAGTTTT CCTTTCTCAC CCGCGCTCCA 7980 

GGTAATACGA AACCCCCACA TGGAGCGGAG TACCAAGCCG GCACCATTGA TCTCCTCTTC 8040 

CTTTCCAATG GGGTGTGGCA CCTTGTGGAC TACAAAACCG ATTACGAAGA GCACCCGGCG 8100 

CGTTATCTCC CCCAGTTGCA GCACTATGCA CGGGCGGTGC AGGATC TCTT CTCGGACCAC 8160 

CCGGTGACGG CCTTTCTGTA TTACCTCCGA ACCGGGCATG AATTTTCTTT GGAAGCGTTA 8220 

GAATCTCATT TTCTGAAAAA AAACGCAGTT CCGGATTCTG AATGATTGAC CCATCTGCCA 8280 

CTTCCCGGTA TGGTTCCCCA CGTTTAGTTA GTAATGGTTT TCGGCATCGG AGAAAAGTGG 8340 
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TGTATCAGCG GGTAGGGCAC AGGCGATTTT CTCTCATTTT CTTTTTCGTT GTGGTTCTGG 
GGCGGTCCCC GCGGCTGTGG GCTCAGGTTT CGTTCACCCC GGATATTGAA GGCTATGCGG 
AgcTGGCCTG GGGCATTGCA TCCGAAgATG GTrGCGCCgg AAaCCTCAAG CATGGATTTA 
AGACTACTAC TGATTTTAAG ATTGTGTTCC CCATTGTGGC AAAGAAGGAT TTCAAGTACC 
GCGGTGAGGG GAATGTCTAT GCGGAAATTA ATGTTAAAGC GTTGAAGTTG AGTTTAGAGT 
CAAATGGTGG AGCAAAGTTT GACACGAAGG GTTCTGCAAA GACGATAGAG GCAACCCTGC 
ACTGTTATGG GGCCTACCTG ACCATTGGGA AGAATCCTGA TTTTAAGTCA ACGTTTGCTG 
TTTTGTGGGA GCCGTGGACC GCGAATGGGG ATTATAAGTC TAAGGGAGAT AAGCCGGTGT 
ATGAGCCGGG GTTTGAGGGA GCCGGGGGAA AGTTAGGGTA TAAACAGACT GACATCGCCG 
GCACGgGGCT CACGTTTGAT ATTGCGTTTA AGTTTGCGTC TAACACCGAC TGGGAGGGCA 
AAGACAGCAA GGGCAACGTC CCAGCAGGAG TAACCCCCAG CAAGTATGGA TTGGGGGGAG 
ATATTTTGTT CGGCTGGGAG CGTACGCtGa AGATGGCGTG CAGGAATACA TTAAAGTGGA 
GCTCACCGGC AACTCCACAC TGTCTAGCGA CTATGCCCAA GCCCGAGCCC TGGCAGCCGG 
GGCTAAGGTG AGTATGAAGC TTTGGGGTCT GTGTGCTCTG GCTGCTACAG ACGTGGGGCA 
TAAGAAAAAC GGAGCGCAGG gCAC CGTAgG CGCAGATGCG TTGTTGACGT TGGGGTATCg 
TTGGTTCTCG GCGGGAGGAT ATTTCGCATC GmAGGCCAGC AATGTATTCG GGGGAGTATT 
TCTCAACATG GCCATGCGAG AGCACGACTG TGCTGCCTAT ATTAAGCTCG AAACCAAGGG 
GTCTGATCCT GATACTTCTT TCCTTGAGGG TCTTGATTTG GGTGTTGATG TGCGTACGTA 
CATGCCTGTC CATTACAAAG TCCTAAAAGC CctACCCCCA GCCATTTACT TCCCGGTGTA 
TGGAAAAGTC TGGGGTTCGT ATCGTCATGA TATGGGTGAG TATGGTTGGG TTAAAGTGTA 
TGCAAACTTG TACGGCGGTA CGAACAAAAA GGCCACGCCC CCTGCTGCTC CTGCTACGAA 
gTGGAAGGCA GGATATTGTG GGTATTACGA GTGTGGGGTA GTGGTCAGTC CGTTAGAGAA 
GGTGGAGATT CGGC TGAGCT GGGAGCAAGG CAAGCTACAA GAGAACAGCA ATGTAGTGAT 
AGAGAAGAAC GTGACGGAGC GTTGGCAATT CGTAGGGGCA TGTCGCTTGA TTTGGTAGGG 
ATGTATGGTT CTTTTCTTTC CGAAgGGgCG AATTTACGCC CCTTCGgAAG GTATGCAAAA 
ATTCCACGTA TCGGGTCACA TATGACCCGA TACGTGGAAT TTTTGCsCCG GCGCATATCT 
GGCCGGGCAT GACACGCAGC GGAGGTAGGC GGGGTGTGTA GACGTTTGTG CCATCACAGG 
TGGCGCGTGT GGGGAAAGGT TGCTTCCCTG GGAGTGCTCC TTTTAGGAGG GCTTGTTGCC 
TGCACTTCAA GCGcAGcCGG GTCAACCTCC AACACGCGGC CGGGGGTGCG TATGACGATC 
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ACCAGCCGCT ACCCCTTCGA TCGCACTATG CAGCTTTTGG aGArcGCTTT GCGCACGCAG 10140 

GGCTTTAGCG TTTTTGGTAT TGTTGACTAC CGCGAGGCAG CCCACAAACA GAACTTGGAT 10200 

ATACAACCTG CAAAGCTTAT GGTGGTGGGC TTCCCTAAAA TTGGCACGCC CCTCATGCTC 10260 

GAGGATCCTT ACTTTCTTCT TCGTGTCCCT CTGTACCTTA TGGTTACCGA TGTGCGCGGG 10320 

AAGACGCGCG TGTCGTTCCA CAATACGCGT GGACTGATGG ATAGCTATGT AGAGCTTTCT 10380 

GATATGGATC AGGCCATCGA GTTAGTAGAA TCCATCGTCA AGAAAACCCT TGC AG AG TAG 10440 

GACGTTTTGG AAAAGAAATT TGCGTTGCTC ATCGATGGAG ACAATATCTC CCCTAAATTC 10500 

CTTGAGGGAA TCGTCGGTGA AGTGTCTAAA GAAGGTGATA TCCACGTTCG CCGTGTCTAC 10560 

GGTGACTGGA CTACCCCTAA CATGAATGGG TGGAAGGGGC TGCTCACGAA AATTCC TATC 10620 

CGACCAGTGC AGCAGTTTCG GTACGGGGAT AACGCCACTG ATAATACCAT CATCATGGAG 10680 

GCTATAGAGC TCGCGAACAA TAACCGGGCT ATCAACGCCG TGTGCATCGC TTCTACCGAT 10740 

TC TG ATT ATT ACAGTCTTGC GCTCAAGCTG CGGGAGTACG GTCTGTACGT GCTCGGTATT 10800 

GGAAAACGAA ACGCGCGTGA GATTTGGGTT TCTGCGTGCA ACGAATTTAA GTACATCGAA 10860 

AATATTGAAA CTGAGCACTT TGGCCTGAGC GCGGGGTTTG CGTTTCATAC TGAGTCAGAT 10920 

GCTGCTGCAG TTCCTGGTGC AGGGGTCGAT GCCGTTGAAG AGGATACTGG GGGTTTTGAC 10980 

TTAGGGAAGC TCATTGCGCA CGCTTACAGA AACTCGCGCA TGACCGAAGA AGGCTGGGTG 11040 

AGCCTTTCAA ATTTAGGAAA GTCGCTGCGC ATCACAAAAC CTGAGTTCGA CCCTCGTTCT 11100 

TACAATCATA GTACCCTGcG GGAAATGGTG GAGGCTCTTC CTGAGCTTTT TGAGGTGCAG 11160 

TCTGACCGAC GTATCCCTCC CAATTATTGG GTGCGTGCAG TGCGTGGTGC CCACAAGCGC 11220 

ACGGTGCTCT ACGGTGTTAT CAAGCGTTTT CGTGAGCGTG ATCGGTGGGG TGTTATC AGT 11280 

CATGAAGAGC TTGGTGATTT TCGTTTTGTG TACAGCAATC TCAAGCGTGA GTGTCGTGCT 11340 

ACTGCCTTGC CTGAAGGTAC AACGGTCAGC TTCTCTGTGT TTCGTATGCC CAACGATCAG 11400 

GGTAAAAGTG ATGAAGAGCG TCACGGGCGC GCTGCCGACG TACTCGTGGT GAAACGGGTG 11460 

GCCGGCTAGC AGACGGGaGT CTGTAAcGTG CACCGGCGTG GTGCGGAgGT GCCGCCCGAG 11520 

TGCGTTAcAC nCTnCACAGT AACCGAGGCC CGCGTAgTGT GTAGCGGGCG TATCCTCATT 11580 

TTTGTGGGCC CTTGTGTGAG AGTTTTAAAC ATGCTAGAGT GAGCCCCGCG AGGGTGcGTG 11640 

CGTGAACTTC AG TCCTAATA CGCTGGGATC CTTTGAAAGC TGCGCTGAGG TTGTGAGGTC 11700 

GCTCGGGTGT GCCCTTGTCG ATCTGCAGTG GAGCGTTTCC GCTGTTTCTC GGCGTGTGCA 11760 

GCAGGCTCAG GGAAGGGCgC GTGCCGTTAT TTACAGCGCA GGGGGAGTGA CGCTCGACGT 11820 
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GTGCGCGCGC GTTCATCGAA TACTGGTGCC GCGCCTTCAG GCTCTCGGTG GTGTGCGCAC 
TGTTTTTCTT GAAGTCGGCT CCCCCGGGGA GCGGGTTATT CGCAACGCCG CGGAGTTTTC 
CATCTTTTTA GGGGAGACTG TGAAGGTCTG GTTTTGcACG GGGCAGTTTC AGGTTGGGAC 
TCTTGCGTTT GCGGATGAGA CTTGCCTTAC CCTGACCGCC GGCGGAGTGC CCGTTACTAT 
CCCGTATGTT CAGCTAACAA AAGCGCAGTT ACATCCTGCA GTCCGCGCTT GAAAGGGCTT 
TTGGTCTGCA CcTTCGCCCA AAATCGCCTA AGGAGCCGCC TATGTTCGGC GTCAGTAACG 
ATGACATTAG AAAGTATGCG CAGGAGAAGG GGCTTGATGA AGACTTTGCC TTTAAAATCG 
TCGAGCAAAC ACTGAAGGCC GCTTATAAGA CTACATTTAA GACAGATGAA AACGCCGTCG 
TTACCTTTGG TGAGGAGCGG GTGTGTATCT AtGCgCGCAA GCGyG tGGTT GAAGAGGTGT 
ACGACCGCGT CTCGGAAGTG GATTTGTCTA CGGCACTTGA GCTTGATCCC ACTACTTCTT 
TAGATAGCGA AGTGCTGGTG GAGCTTGAGT CCGAAGATTT TAAGCGTGGA TCTGTGCAGG 
CTGCCGTCCA GCGTATCACT GAGCTGAGCA GAGAAATTCA AAAGGACGCT CTG TATGCTG 
AGTACAAGAG CAAAGAAGGA GAGATTATCG TTGGC TACT A CCAACGCGCG CGAAACGAGC 
ATATCTACGT TGACCTAGGA AAAGTTGAGG GCCTGATGCC AAAGTCGCAC CAGCTGCCCC 
AGGATGATTA TCGTCAAAAC GACCGCATTA AGTCGCTTGT GCGTGAGGTG CGCAAACATC 
CAAAGTCGAG CGTTGTCCAG CTCATTCTTT CACGAACTGA CTCTGCTTTT GTAAAAGAGC 
TGCTCGCCGT GGAGGTGCCG GAGATCTACG ACGGTATTGT TGAGGTGGCA AAAATAGTGC 
GGGAGCCAGG GTACCGTACA AAGATCGCCG TCACCAGTAG GCGTGATGAT GTGGATCCTG 
TTGGTGCCTG CGTAGGTCCT CGGGGCATAC GCATCCGCAT GGTTATTAAA GAATTGAATG 
ACGAGAAGAT AGATGTGCTT GAGTATTCTC CGGATCCAGT TATTTTCATC AAAAATGCGC 
TTTCTCCTGC TGAGGTGCTG AACGTCGTGG TACTTGATGA GGAGAAGCGT TCTGCACTTG 
CCATTGTTGC TGAAAgCCAG CTGTCTATCG CGATAGGAAA GCAAGGTTTG AACGTGCGTT 
TAG cG AATCG GCTTGTGGAC TGGAATATCG ATG TGAAG AC AGAGAGTCAG TTTGAAGAGA 
TGGATGTGTA CACTGACACG CGTCGTGCGG CAGAAAATCT TTTTGATAAc GATTATCAAG 
AAGAGTCTGA GTTTTCyTCa TACGkGGGAT TTACgCCgGA GCTCATTAAG ATTCTGCAGG 
ACAACGGTAT CCAAGACGTA CAGACTTTGG TAGATTTGGG CGAGGAAGGC TTGCGTGCGC 
TTGAGGGCAT GGACGAGGCG CACGTACaAG AATTGCTCGC CgCCATTGAG GAGAATTTTG 
AAGTTGTCGA GGAnGGGGAG GAGGCTTCAG TTACATCTTC TCCCGGGACT GGTGGTGATG 
ArGATCAGGC gTTGCAGTGT CCTGAgTGTG GGGTGCgCaT TACTACTGAC ATGAGTGAGT 
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GTCCTCACTG TGGTATTGGC CTCAGCTTTG AGTTTGAATA CGAAGAGAAC GwssmaTAGG 13620 

AGAGCTATGA CCtACGAGAC AATACGCCTA AAGACACTTC CCGTGTTGCT AGTGAgCaGG 13680 

CTGTGCgTTC aCCGGTGAaC GTCTGGTCcA AACTACCTCT CGTACACGGG CgGATGTTGA 13740 

CGTAAAGGAG AAAAGACTCg TaATAAAGAA GACAATCAAA GTGCGCGCaA AGAAAGTGGT 13800 

TGCCAAAGTT ACTgrTGCGCG GCGTGTGTCG TGCGCGGATG AAAATCGCAC GCCGGGCGAC 13 860 

GCGAGTCAGG CGACTATTTC TGCCGCGCCC GAAGATAAAA AGCAAGGTTT CCCTGACATT 13920 

CGGGAGGATG GCGTTGCGCG TGGTGTATCT GCCTCGTGTG GCGCTGTGCA GAACGCTGCG 13980 

TCTGCACAGG TTCCCGGTGC CCGTACTCCG GGGGTTATAG GCGTTCCTGT TGCCAGCAAA 14040 

ACGGTGGAGG AAGCAAGGGG TGGGGGAGCT AAGCGGGTAA TCACTAAGCG TGTGGGTGGG 14100 

GTTTTCGTGC TTGATGACTC TGCGGCACCC CTAACCGAAA GGCAGGAAAC CTTGCATCTG 14160 

GCGCGCGCCT TTCTCGGTTT AGCCGCAGTG ATCGTCAGCG CACAtyGGGT TTTCTGGTAC 14220 

TCAAGCGCGT GCTAACgCAG GTGGTGTGCG GCGTGGAGAG GGCCGTCCGT TTGCTCGCGA 14280 

TTTCAGTCGT GGGTCCACGG GTGGGTATCG GCCCGCAGTG AGAGGTCCGG CTCGGCCGGC 14340 

TGGACGTGTT GGTTCGGGTC CAAGAGGGCC GGCGCCCCTG CAAGTAGGTG CTGGTAAGCC 14400 

TGCCCAGAAC AAAAGGTCTT TCCGGGGCAG AAAGCAGCAG ACATATCAGT ATCAGCATAA 14460 

GGATCGTCTT GAACTGGAAG AAAAGCTTCT CCAGCAGAAG AAGAAAAATA AGGAAAAGCT 14520 

TGCGGCGGTC CCGCGCTCTG TTGAGATCAT GGAGTCCGTT TCGGTTGCAG ATCTCGCAAA 14580 

GAAGATGAAT TTAAAAGCCT CAGAGCTTAT CGGTAAGCTT TTTGGCATGG GCATGATGGT 14640 

TACCATGAAT CAGTCTATCG ATGCGGACAC CGCCACGATT CTTGCTTCTG AGTACGGGTG 14700 

TGAGGTAAGG ATTGTCAGTC TTTACGATGA AACAATTATC GAAAGTGTAG GTGACGAGCA 14760 

TGCGGTGCTC CGCGCACGTC CGCCAGTAGT GAC TGTTATG GGACATGTTG ATCACGGAAA 14820 

AACTAAAACG CTCGATGCCA TCAGAAGTAC GCGCGTTGCT GAGGGGGAGT TTGGCGGTAT 14880 

CACGCAGCAT ATTGGTGCTT ATGCAGTCTC TACTCCGAAA GGCTCAATTA CCTTTTTGGA 14940 

CACGCCAGGT CACGAAGCTT TTACCATGAT GCGCGCGCGT GGAGCAGAAA TTACCGATAT 15000 

TGTGGTGCTC ATCGTAGCTG cAGACGATGG GGTAATGCCC CAGACGATCG AAGCGATCAA 15060 

TCACGCAAAG GCTTCGAAGG TTCCCATTAT TGTTGCAATC AACAAGATTG ACCGTGCGGA 15120 

TGCGAACCCG AATAAGGTCA TGACGCGCCT TGCTGAGCTT GGCTTAGCTC CAGAGGAGTG 15180 

GGGTGGTGAT ACCATGTACG TGAGTATTTC TGCGCTGCAA GGTATTGGGT TAGATCTGTT 15240 

GCTAGATGCC ATCATGCTGC AGGCGGAGGT GATGGAGCTT CGTGCAAATT ACGGGTGTTG 15300 
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TGCAGAAGGG 


CGCATTATAG 


AGTCTAGGAT 


TGATCACGGG 


CGGGGGATTG 


TCGCGAGCGT 


15360 


TATCGTGCGT 


CGTGGGGTGC 


TTCGTGTTGG 


TGACACGTAC 


GTTGCaGGTG 


TGTACTCAGG 


15420 


GCGTGTGCGG 


GCAATTTTTA 


ATGATCAAGG 


GGAGAAGATT 


CAGGAGGCGA 


CTCCTAGTAT 


15480 


GCCCGTTGAA 


ATTTTAGGGC 


TTGAGGGAAT 


GCCCAATGCG 


GGTGATCCTT 


TTCAGGTTAC 


15540 


GGATTCTGAG 


CGTATTGCAC 


GGCAAATTTC 


GCTTAAGCGT 


CAGGAGTTGA 


GGCGTTACGA 


15600 


AAATGCGCGC 


AACGTGAAAA 


GGATAACGCT 


TGACAAGCTG 


TACGAGTCTA 


TCGAGAAGGG 


15660 


TTCGGTTTCG 


GAGTTCAAGG 


TTATTATTAA 


GGGGGACGTG 


CAAGGATCGG 


TTGAAGCGCT 


15720 


CAAGCAATCG 


CTTGAAAAAC 


TTTCTACCGA 


TGAGGTGCAG 


TTGCGTGTCA 


TTCATTCGTC 


15780 


GGTTGGTGCG 


ATAAATGATT 


CTGATGTTAT 


GCTCGCAGCT 


GCTGATTCAA 


ATGTGACCAT 


15840 


TGTTGGTTTT 


AATGTACGTC 


CCACTCCCCA 


GGCTGCGGTT 


CTTGCAGAAA 


GGGAAAGAGT 


15900 


AGAAATCAAA 


AAGTATACTG 


TCATCTACCA 


GGCGGTGGAG 


GAGATGGAGC 


GAGCTATGGA 


15960 


GGGTATGCTC 


AAACCATCCC 


TCAAAGAGGT 


AGTGCTCGGT 


TCGGCGGAGG 


TGCGCAAGGT 


16020 


GTTCAAGATT 


CCCAAAGTGG 


GAAGCGTTGC 


AGGAGTATAT 


GTGCTTGAAG 


GGGTAATGAA 


16080 


GAGGAACGCC 


ATTGTTCACG 


TTGTGCGCGA 


TGGGATTGTC 


CTGCATTCGG 


GGAAGGTTTC 


16140 


CTCATTGCGG 


AGAGAAAAGG 


ATGATGTGAA 


AGAGGTACAC 


AGCGGCTTTG 


AGTGTGGGGT 


16200 


TGGAGTTGAA AATTATTTTG 


ATTTTAGGGA 


GCGTGATCGG 


CTTGAATGCG 


CGGAGATGAA 


16260 


GGAGGTGTCG 


AGGAAACTGA 


AGGATGCCGC 


TCTTTCCGAT 


GCGGCGCGCT 


TACAGGGATG 


16320 


AAAcAGGTAA 


GTCAGTTAAG 


GGTGCGCAAA 


TTGGGGGAGC 


ATATCCGCGC 


AGAAATAGCG 


16380 


CAGCTTATTA 


TGCTCGGCAA 


AATAAAGGAT 


CCACGTGTTT 


CTCCCTTTCT 


CTCTGTGAAT 


16440 


TGGGTGGATG 


TGTCTGGGGG 


GATGGTCTGT 


GCGCGGGTAT 


ATGTGTCGAG 


TTTTATGGGT 


16500 


AAGTACAAAA CGAAgCAGGG AGTGCAAGGC 


TTAGAAAGCG 


CGGCAGGTTT 


TATTCGCTCT 


16560 


GTCTTGGCTA 


AGAAACTCCG 


TCTGCGGCAG 


TGTCCGCGTC 


TTAGCTTTGT 


GTATGACGAG 


16620 


AGTGTGAGGG 


ATGGATTTTC 


TCTTTCGAGA 


AAAATAGATC 


GGTTAGAATC 


CGGCGGTGTG 


16680 


CAGACTGAGC 


ATGCCtGACG 


CTATTGTTCC 


TTTCGCAAAG 


GTTTCCGGTC 


TTACGAGTTT 


16740 


TGCGGCACTG 


GCACAGGTCA 




GGGAGTAAAA 


AAGGTAGGGC 


ATACGGGGAC 


16800 


GCTTGATCGC 


TTTGCTGATG 


GGCTGCTGTT 


GCTTTTGGTA 


GGGGGCTTTA 


CCAAACTCGC 


16860 


GCCGGTGATG 


ACTCGCTTGG 


AAAAGAGTTA 


CGAGGCTCGT 


ATCCAGTTTG 


GGGTACAAAC 


16920 


AGACACTCTA 


GATCCGGAGG 


GGGCTGTCGT 


GCGGTGCTCC 


TTGTTCCCAA 


CATTTGCGCG 


16980 


CGTGCGTGCG 


GCGCTGCCTC 


ACTTCACTGG 


GAGTATTGAT 


CAGGTGCCGC 


CTGAATATTC 


17040 
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GGCGCTAAAA TTCGGAGGTG TGCGTGCGTC CGACCGGGTG CGGCGTGGGG AAGCAGTGTG 17100 

CATGAAGGCT CGGCGTGTGT TCGTCTTTGA CTTGCAGGTA CTAGGTTGCG AGGCGGATCT 17160 

GGGTGAATTC AAAAAGACGC AGGCGGGGAG GGGGGCTGCG ATTGCTGATC TTGATCTGAC 17220 

GCGCGTGCGT GCTGTAACGC TGTACGTACG TTGTTCGGCA GGCTTCTACG TGCGTGCACT 17280 

TGCGCGCGAC ATAGCAGCCG CTTGCGGCTC TTGCGCGTAT nTTCACATTT ACGGAGAACA 17340 

CGCATTGGAC CCTTTGATCT TGCACAGGCG GCGGGTGT 17 378 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5641 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GAGGAAGGCA AAACCTTTaA TTAAAGTACA CTGCGCGTAT GAGCAAAAAA TGCGCGCCTG 60 

TTTGAATATT ATTCTGCACA CAGTACCGAA GTGTrtCTGT TGGGTATGCC AGAGACACGT 120 

AACAAACAGT TGAATGAGAA GCTTGTGTAC ATCGAGCACG TACAAArGaA AG TAGTGGCG 180 

CAATACGATC CGCAGCGGGT GCGCTATTAC TCCCTCAAAC CAATTGTACC CGGTGTACAC 240 

GGAACATATG CAAGCGCGAT AAGGGACACG CACGGCCGTT GGGTACACGT GATGCACAAA 300 

GACGGCATCC ACTACACCAT AGAGGGTGGT GCGTACGTTA TGGAAACTCT CTTACCCCTT 3 60 

ATTCTTGCAG ATTTGGAACG GTCTCGTCAC GGATACATGC GTTCTTCTCT GGGGTCGCAT 420 

GAACTCCCTG CGACGAAGGG aTGGAAAGAG CACGTCACGC GTCAACTCGA ACATAGGGAT 480 

AAACCGCACC GTTGTATCCT GCATGACAGG GGGTGCGTCC TGCArGGGGT TATCGTACGT 540 

CCACACCGTT GCCTGCACT? GCATAGTGCT GTTTTGGTCA CCTGAGAATG TTACCGTAAA 600 

GGGGAGTGGT GGGCGCGCCT GCGATATGAA CCGTACCACA GATCCCTGCT CCAAACGCGT 660 

AAAAGGCCTA TCTGCTTCGT ACGCAGCAGT TCCGTGTGGA GAGGTGATGC GCAGTTCGAT 720 

TGCCTGCGCG CGCAACTGAG AGCGCACCTC CAGGACGTTG ATGCGTCCGC CAAAGTAACT 780 

GcGAGAgTCC ATGGAGATCT GGAGGTGATC CTGTGCCTTT TGTATTTCCG CCGTGGACAG 840 

CCGCGCGTGT GTGCGTGCAC GTATATGCCG AGGAATGGGC AGTGGAGAGT GAAGCGAAAG 900 

GGTGAGCCCC CTCTCACTAA TGCGATAgCA TGCGGTAATT TCCTGTGGGA TACGCGTGCT 9 60 

TGACAGTGAA aniGCTCCAGA GCACACCGTT GATAACGAGC ACAGTCAGTA ATGCACCAGA 1020 
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AAGCAGAAGG TTTCTCCTAC GTATGAAGGG 
AAAGAGAGTG AGCATAAAAA GCATAAACGG 
TTGGTGGAAG G AT AACGGC G CGGTAACCGC 
AGCGAAGAGA AGAAACATCA TTGCAAGCAT 
AAAGAAGACA AGCGAGAGCG CATACTCTAG 
AACAGAAAAA AGGACCGCGT ACAACAGACA 
TCCGTTGTGC AACATGGAAT CTCTGATTGT 
TGCAATTGCG ATGCTGTGCT TTACTATGAG 
GTGCGTGCCA AAGCGGATAA GAAAAAAAAG 
ACACACGCTC AGTACCCCTA AGATAGCAGG 
GTAATGCCTT TTGCGCGAGT CAGAAAGAAA 
GATGACAGCG CCTAAGATAA GAATCACGAA 
GCCGTACGAA ACACTCACGT AATGCGTGTC 
TGCCgGGAGC GTGTGCAGTG CACCTGGAGG 
AATATGCTCC TTGATGTAGA GTGCATGGCG 
CAATATGGCG TCAAAGTCCC GATAACGGAT 
TACTGCTTGA AGAAGCCAGG GTGGGCACAC 
TGGTCCTGTA GTTTCGTCGA GTATCAGGAC 
GATGAGTTTT TTAGTTCCTG TAAGCCGATC 
GTCGTGTGCA GTTATTGCCA CGAGTAC CGA 
CACCAGCAAG CGGAGCTGCT GTACCGCACG 
AGTGAAGACG ACGAGCACAT CCGTTCTTGG 
GTGTACAAAA ATACCACAGA GCACGAAAAG 
TTATCGGTAG GGAGTGCGCC GGTTTCGAAC 
ATCcTGCGCA CTATATCGGC TGCCTGTTCG 
TGGTATTTTT TCacAGTCGT TTCCACGACT 
GTACTGAAAG GACTGCAAAA TCCTCAACGA 
CTGGGGGAGG GTTCTTTTTT GGCGGCGGTC 
ACTTGCCGCC GCGGTTGTCC CATTGATCGA 
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GATGCGAGGG TGGGATTTTt CCCGTTCAAA 1080 

AACACAACCG AGTGCAAAAA ATACGTTGCT 1140 

ACTGTTCTTT AAGTATGCGT AAAGGAAGGG 1200 

TATCCGTCGT GTAGTCTTTC TCTGTGCGCT 1260 

AGCGAAGACT GCTGAGAGTG AAAAATCGAT 1320 

GAGTGTGCTT GCAAGATATC CTCCGATAAA 1380 

CTTGCTGTTT GCCATGCACA CGCACGAGAG 1440 

TGCAAGAAGC GGTAATGCAC CTGTTGATGT 1500 

AGCGGTAAgT TGTGTTGCAA CGAACACGCT 1560 

GAGCCACCAC ATGGCTAAAA TAGTGTTCCA 1620 

ACTAAAAATA GCCAAAGAGA GCAAAAGAGT 1680 

AAATTGCTCT CGAATGAGGT AGAGCGTGCC 1740 

CCATTCTTCT GAGTACACCT GGGTAAGGAG 1800 

GTACAACGAA TTTTCCATCC TTACCGCaGg 1860 

TGGGTCCTCG TGCAACCATC CAAGCCGGTG 1920 

GGGGACGTGG TGTGATGTTA GGTGTTGGTA 1980 

TGTACGGTGT GCCCCCGTGT GCAAGCGCGG 2040 

TATTGGCGCC TTATAGGAGG AGATGAGCGA 2100 

GACGGGCACA AAATCAGGAA CGGGAGGATG 2160 

CACGTCTGGG GTCTGGTGCT CAAAGTCCTG 2220 

CGTCTCGCGT TCTTGCGCTT CACGCGATGC 2280 

TCCAAAGAAG TGAAGCTCGT TTTCCTGTGC 2340 

CAGCGCGCGG CACACCCGAG TCATGACTTG 2400 

CATGTCTCTA TGCGCTTGTA GGAGCTATTA 2460 

GACGTGTGC G TACACGTGAA GACGTCAGGA 2520 

GCTTGcAGTA GGAGAGGGGG AGCCCTGCAG 2580 

GTTCAGGAGG AACAGGGACG CGCACTGGTC 2640 

GGCGCTCCAT GCGTCCTGCA CAGGTGCGGT 2700 

AGGGGTCTTC GTCGGAGTTG AGCCGGTCGC 2760 
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GTAGGATAGC ACCGATTCGC TCGTAAAAGC TGGTCATAGA CATGTTCCTT TTTGAATAAT 2820 

GTCGCCGAAG CGTGCGCCTT CTCtGCGTGT ACGGAGCGTT GGAGTACTGC CCCCACAAAC 2880 

AGGTTGGCGT GTTCGGGAGC TTCCCCGGCA GAAAGATTTG CCckCTGCGC ACTATTACCA 2940 

GGTTCTCTTT TTGGAGCACA GTCCCACGGG GGTATGCGTG GAGATAGTGA AGAGAGCGGT 3000 

TAGATTTCTG GTAGTGAGCG CGTTCTGAGG GAGCAAGCAC CTTCTCTCCT GAACCGATGA 3060 

CGGCGCGAAC AACGTGCGGG GCATACCCAC GCTCGTGTAG AAAGGAGATA ATCTGTGAGG 3120 

GAGAACGGCG AGCGCATGAG TTGAGCsCCG CTGTCATGGT GCGAAAGTCG GCAGGATCTA 3180 

ATGCAATGGA GTCATCGAGT CCTGCATCTG TTCGCGAGAG GCAAATGTGT TTTTCGACGA 3240 

TGCAGGCGCC GTGTGCACGG GCAAGGAgCG GGACAAGGAG CGGGTCTACG CTGTGGTCGC 3300 

TGACGCCGAC GTTGATATTG AAGATGGTAG CAAGCGCAGG CAGCAGCGCA AGGTTGTACT 3360 

CTGTCTCTGG AGCAGGGTAT GCGGTGATGC AGTGCAGTAA GGCGTGGGAG CTGCCCTGCT 3420 

TGGTATACTG GCGGCATTGG GCAAGGGCCC CTTCGATTTC CTTCAGGAGG CAGACTCCAC 3480 

TTGAAAGTAT AAGTGGAAGT TCTGCAGCAG CGAGTGTGGA GATAAGGGTG GGGTAGTTGA 3540 

GCTCTGGGGA AGCTACCTTG AGGAAGTCTG GTTTcAAGGC GAGCGCCTCT GTTGCAGAGC 3600 

GCGGGCCAAA GGGGCTGATG CCGACTAGCA TACCCCTGCT TCGTGCGTGG TTAAAGCACT 3660 

GCGCATAAAA GGAAAGTGGA ACTTCTAACT CCTCAAAGCG CTGGTAGAGG GAAACTGCTC 3720 

CGCTGGGAAG ACGGACAGCC CCCGTCAGCG GGTGCAGTAT TTCGTGCGCG TAGATGAGCT 3780 

GGAATTTGAC CsCAGCTGCT GCTGcgTCTG CAGCTGCGTC TATGAGCGCC CGCGCGCGGt 3840 

cAAACGAGCC CGCGTGTGCG aGCCGATTTC AGCGATGGTG AGTATATCCG CGTCTGGGCG 3900 

AAAACAACGT CCCCCGCACG TGAACATGGG GCATTGTACG CCAAACGCGT GATTGGTGTA 3960 

TAGCTTTCCT GATCGGTAGG CAATCCTTGC CGTGGTTTGT ATGGGTAAGA GGCAGGTGCT 4020 

AAGATAGTGT GCGCTTGTCA GACATCTATT TTTGCAGTAC CGTCGTGTCG GCCCTGCGGG 4080 

TGCCGAGGAT GAACGGCATG TTGCGCACGA GCGTGTTGGT ATGTATTGGG TGTCTCTCTG 4140 

CTGCAATCCC TGCGCGCTTA nGTGCCCGTG CGGTGCCGCC TCTCTCTAGT GCGGTGGTAG 4200 

ATGAGGCGGC ACTCCTTTcT GTGCArGAGG CGCGTGGTAT TCGCGCCCTT CTAgAgGGcT 42 60 

TGCGCGCCGT TCTGGArATG GCTCTTCCAG ATCGCATCCT TCTCCTGCGC CTGAAGCTCA 4320 

TGCGCGTACG CTCCATGACG GTACGCCGTT GCAGATAGCG GTTTTGATTG TTGATTCGCT 4380 

CCAGGGGGAT AGTCTTGAGG ATTTTTCATT GCGTGTGGCT CAGGAGTGGG GTATCGGCAG 4440 

TCGTGCGCAG GATACAGGAA TTGTGTTAGT GATTGCGCGC GCGGAnTAnA AGCACGCATC 4500 
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GAAGTAGGAT ACGGTCTTGA AGACCGCGTC ACCGACGTGC ATGCACATCA GCTTATCCGT 4560 

GGGACgCTCG CGCCGTGTTT TCAAGCTGGC GCCTATGCAC AGGGTGTGTA CGAAACGGTG 4620 

TTGCGTTTGG CTACCCTGGT GCGGGGTCAA CACGAGGTAC AGCAGTTCAT GCAGCCGCGC 4680 

TCTGTGCAAC CTGCGGTACC GCGCCGGGGT CCAGTGAGAA ATAGTGCCGG GAGCGTGTTT 4740 

TTCTTCCTGC TGCTTTTTTA CTGTCTGGGG GGCCGGCTTT TGCCAGGGGG AGTGTTGTGG 4800 

CCATTGCTGT TCTTCGGCAC TCGGCGGCGT TATGACCCGT TCGGGTCAGG GTTTAGCGGC 4860 

GCATTCGGGG AGTGGGCAGG GGATGGAGGA GGGTTTTCTG GCGGTGGTGG TCGCTTCGGT 4920 

GGAGGCGGGG CCTCTGGTTC TTGGTAGCTG CTCCTAGCAC AGCACGGTTT CTTTTTCTGT 4980 

ACGGGCAGTC TCTCTTGGAA GAGGTGTATC TATAGTGTGC TCGGTGACGC ACGGGAAAAG 5040 

CATAAGGAGT GAGAACAATG ACTGAAGAAG CTATGCGCGC GATGGCACTT TCCATCCGCA 5100 

GTTTGACGAT AGACGCCATC GAACGGGCGA ATTCTGGTCA CCCTGGTTTG CCGCTGGGCG 5160 

CAgCAGAGCT TGCTGCCTGT TTATATGGGA CGATCTTAAA GCATAATCCG GCGAATCCTA 5220 

GCTGGTTTAA TCGGGATCGT TTCGTCCTGT CTGCAGGACA CGGGTCTATG CTCTTGTAaT 5280 

GcTGCGCTCC ACCTTTCTGG GTACGACGTT TCGCTTGAGG ATATTAAGAA CTTTAGGCAG 5340 

GTAGGCTCCC GGTGTCCTGG CCATCCTGAA TACGGTTGTA mCCCCGGTGT GGAAGCAACA 5400 

ACCGGTCCAT TGGGTCAGGG TAcTCTATGG CGGTGGGTTT tGCGCTTGCA GAGGCAATGC 5460 

TTGCGGCAmG TTTTAATACt GATGAgCAtG CCGTTGTAGA TCACCACACC TATGCGCTTG 5520 

TGGGGGAAGG CTGC CTTATG GAGGGCGTTG CCTCAGAGGC TTCTAGCTTT GCCGGCACTA 5580 

TGCGTCTGGG CAAGCTCATC GTTTTTTATG ATGAGAACCA CATCAGCATA GACGGATCTA 5640 

C 5641 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8790 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GGCAACAGAA AGCGGCGTAT GTCCG TCAGC GTCGCGTTCC TGGGTCGGGG GACCGAGCGT 60 

GACAAGGTGT TTGATCAGAT CGCGGTCCAG CGCACGCACG GCCCAATGCC AGGGTGTTTT 120 

GCCGCTTTTG TCCTTTTGTA GAAGTGTCTT TCTGGTTACC AATTCGTGCG CAGTTCCcTT 180 
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ACGGATGGCG CGGGTAAGCG GAGTGTCTCC CTGTGCATCC CGCGAAAAGA GGCTCGCGTC 24 0 

TCGTGCAATC AGCATACGAA CAGACTCAAA ACAATTTTCT GTGACCGCCA CCATAAGCGG 300 

CGTACTGCCT GAAGCATCCT GGGCCTCAGT ATCGGCACCC ATAGAGAGGA GGAAGTCAAC 360 

CACGTGCGCG TCGTTACGCA ACACCCCCAC GTGCAAAAGT GTATCTCCAT TTGCATCACG 420 

GACGTTGACA GAATCTTTAC CAAAACGGGT CTTCAGCGTA TCTAGATCAC CGCGCGCAAC 480 

CATTTCAAAG AGATCGACCG AAGCCTGAGG AGAGGGAGAA GACGTAnTAG TGCAGGAAAG 540 

CAAGACGAGG AAACACGCAA ATGTGCTCCC CACAAACCAC ACAATGCCAC GATTATGTAT 600 

ATGCATGCAG CGGATCCTCC TGAGTATGGT GCCGCGTCTG TACAGTGTGT GTAAAAGCGT 660 

ATCCTACCGG TTTC GGCG AT AAGGCACAGA ATCTTTAGAC GCCCACTCTC CCGTGAGGAC 720 

GCAACcGCGC AGGCGCGTTC CCATTTTTAA AAACCCAGTA TCTGCTGACG GTGATGTAAT 780 

CCGAGGTCTT TTAATGTGTC ACACACCTGC TGCTGAGGTG AcCGTGCACG TGCCTTTGCA 840 

AAATCACGCG CAAAACAATT CATTGAAAAA TGTTGAAATA CAAAAAGCGG CGTACGGCGG 900 

TCACACACCG TTTGTAGTAA ACGCGCATAG GGTTCAAACA GATGTCCGCT TTTCAATTCG 960 

CTGTAACCCA CAATCCACAG ATCACATCCT AACGATCGCT GCAGCTGACA CGTACGCCGC 1020 

GCCATCCAAC ACTGGCTACG ATGCAAAAAT GACTGCACCT GCCGTGCACG CgCAGTTCCT 1080 

GCTGGATCTT GCGCAAGGaG GCGTACCAGT GCACGGAGCT GAACCGTTTT TGCCGTGTGA 1140 

AGCGGCGTTT TGTTCAACAG CATCACGTCT GTACGAAAAT CAACCCCTAA CTGTGcATGC 1200 

CGCGAAAAAA AGCCCTGaGC CACTCGACCT GACTGTCCTA CCAGATACCG TTGCTGCGTA 1260 

TGCacTGGCT CCTCTTTCCC TGGATTGTCC GAAACCAAAA TAAGGGCCGG TACAGGATCA 1320 

CTTTGGGTCA GTTCCTCAAG CGCGTGAGGG TACACAATCG GAGTGTGTAC CGGATACGTG 1380 

GGTATTCCTT GTGCACGCAC GAGTGCCGCT TGCGCACGGT GCAAAAAAGC CTGCGGCGTG 1440 

CAACACGCcA CTGCAGACTG CGCTGCATGC GGGTCCATAT ACGGATACCC AAACGCGCGC 1500 

AAACTCTGTA CACAAAAAGC CTTGAGGTCA GTGCGAAAAG CGGcGAGCGC ATGCCACTGC 1560 

GACCGAGTcA CGCGCTcACA TcCAAAACAG AAAGCATCCT CTACTATACC CTACATACCA 1620 

CGTCCCTTCC TACAGACTGc AGTGACGGCG CAGGCGCACT GGCTCAGTGC TTCCTCCAAA 1680 

ACGGCGCCCA TTGACAAACC ACCCATAAGG TCTCACGATT GGGCCTCTGT GTAGAAGAGA 1740 

ATATCACCAT GCTGCAAAAA CGCTCAGATA CCCTCGACCG TCTGCGTCAC AGTCTGGCGC 1800 

ACGTTATGGC AGAGGCCGTT CAAGCTCTCT TCCCCGGCAC CAAGCTCGCG GTGGGGCCGC 1860 

CTATCGATTA CGGGTTTTAC TATGACTTCT CACCTCCCCG TCCCCTGTGC GATGCAGACC 1920 
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TAGCCCCCAT TGAAGAGAAA ATGCGCGCCA TCTTGCGTGC GGGGTGTCCC TTTGTCAAAG 1980 

AGGTGGTTTC GCGTCCTGAC GCGCTTGCTC GTTTTAAAGA CGAGCCATTC AAGCAAGAGC 2040 

TCATCGAACG CATCAGCGCA GACGACACGC TCAGTCTCTA CCACTCCGGC GCGTTCACTG 2100 

ACCTGTGCCG GGGTCCTCAC GTGCAGTCTA TGCGAGACAT TAATCCGCAC GCCTTTAAAC 2160 

TCACGAGCAT CGCTGGGGCC TATTGGCGCG GTAATGAGCG CGGCCCCCAG CTGACGCGCA 2220 

TCTACGGCAC TGCCTGGGAA TCTGAAGAAG ATTTGCACAC ATACCTTCGC ATGCAGGATG 2280 

AAGCAAAACG CCGAGATCAC CGTAAGCTCG GTCCTGCACT CGGTCTCTTT CACTTGGACG 2340 

AAGAAAATCC TGGCCAGGTC TTTTGGCACC CTGAGGGGTG GACCCTCTAC GTGGCCATCC 2400 

AGCAGTACTT GCGCCGCGTC ATGCACGAAG ACGGGTACGC AGAGGTGCAT ACTCCCTTTG 2460 

TCATGCCCCA AAGCCTTTGG GAACGCTCGG GGCACTGGGA CAAATACCGC GCCAACATGT 2520 

ACCTGACCGA AGcGAGAAGC GTTCTTTTGC GCTCAAGCCC ATGAATTGTC CCGGACATGT 2580 

CGAAATCTTC AAGCAAAAAA CACGCAtTAC CGTGATCTCC CGCTCCGTCT TTCGGAGTTT 2 640 

GGCTCGTGCA CCCGCAATGA ACCGTCAGGC TCCCTGCATG GAGTTATGCG CGTACGTGGC 2700 

TTTGTACAAG ACGATGCCCA TATCTTTTGT ACTGAGGCGC AAATCGCATC GGAGGTCACC 2760 

CGTTTCTGTC GCCTCCTTGC GCGGGTATAT GCTGACTTTG GCTTTGCACA GGAGCAGATC 2820 

CGCGTCAAGT TTTCTACGCG CCCAGAGCAG CGCATCGGAG ACGACGCCAC CTGGGACCGG 2880 

GCCGAACGCG CATTGGCAGA AGCATGTGAA GCAGCAGGCC TTTCGTACGA GCACGCACCG 2940 

GGAGAAGGAG CGTTCTATGG ACCAAAGTTG GAGTTTGCAC TTATAGATAC AC TCG AACGC 3000 

GAGTGGCAGT GCGGCACCAT TCAGGTAGAC TATCAGTTGC CCTCGTGCGA GCGCTTGAAC 3060 

GCAGAGTATG TGGGGGAGGA CAACCAACGG CACATGCCAG TGATACTCCA CCGCACGGTG 3120 

ATTGGGTCTC T AG AACGG TT CATCGGTATT CTCATTGAAC ACTACGGGGG TGCATTCCCC 3180 

CCATGGCTCG CACCGGTGCA GGCAGTGGTG ATTCCGGTTG CCCCTGCCTT CCTCGAATAT 3240 

GCGCAgcACG TTGCACGGGA GCTGTGCGCC CGTTCGCTCC GCGTGCAGGC AGACGTGAGC 3300 

GCAGAGCGCA TGAACGCAAA GATCCGCACT GCCCAAACGC AGAAAGTGCC CTATCTGCTC 3360 

ATAGTTGGCG AGCGGGAgTG CGCGCGCAcA GGtAGCGGTG CGTCCGCGCA CAGGGCCCCA 3420 

GCACTCAATG GGGCTCTCAG CCTTTTCCAC CTTTTTGCTC GCGAAcTAGA GACGCGCGCG 3480 

CTGCACGCCT AGCCCATGAG TCCCCTGTGC CTTTTCCCCA AACCTTCAGG GGAAGGGACG 3540 

CTATATCCGT AGCTGCTGTA CGCTACCGCC GTAGAG t GCG CGCGCGTGGC GTTGATATCC 3 600 

TC AC TCTTT A CATAAGAaTC AAAGTCCATC AT AC GATCG A TAATCCCGCG CGGCGTAATT 3660 
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TCCACAATGC GGTTTGCAAC AGAGCTGACA AACTCATGGT CATGCGAATT AAATAAAATC 
ACGCCGGGAA ACTGCACCAA CGCCTCATTC AGACTTGCAA TTGCTTCTAG GTCCAAATGA 
TTGGTCGGCT CGTCCAATAT CAAAACATTG CTCCCAGAAA GCATTAATTT ACTAAGCATG 
CAGCGTACTT TTTCCCCTCC AGAAAGTACA CGCACAGATT TGAGCGAATC CTCGCCTGTA 
AAAAGCATCC TGCCTAAAAA ACCGCGTACG TAGGTTTCAT CTTGATCATC AGAGAATTGG 
CGCAACCAAT CCGTGATAGA AAGATCACAA TCAAAATACC GCGCCGTATC CTTTCCCATA 
TACCCAACAG ATACCGTCTG TCCCCAACGG AAAGAGCCgG CATGTGCCTG CTTTTCTCCA 
GCAAGAATAT CAAACAATAT GGTCTTCGCG CGGTGTTCTT^^^^GACGAA AGCGATTTTG 
TCTGTGCGCC CAACTGTAAA GCTCATGTCT GTAAAAAGCT CACATGAACC TCCCTGCATT 
CGGTCCTCAG CGGCATAGCG CAGTCCATCG CACGACAATA CGTGATTCCC AATTTCACGC 
CGTGGTTTAA AATGCACATA GGGAAACTTT CGACCAGTCA CCTCAATCTC TTCCAGCACC 
AATTTGTCAT ATATCTTTTT ACGACTCGtC gccTGCCGGC TTTTGGCTGC GTTAGAAGCG 
AAgCGCAAAA TAAACTCCCT CAGGTCCTTC ATCTTTTCTT CACGCTTCTT CTGCTGATCC 
TTAACCTGCC GCTGCATAAT CTGACTCATC TGATACCAAA AATCGTAATT GCCCG AGTAC 
AAACGAATCT TCCCATAATC GATATCGCAA ATATGCGTAC ACACGCTATT TAAAAAATGC 
CTATCATGCG AAACTACAAT CACAGTGTTG GGGAATTCAA TGAGAAATTC TTCCAACCAC 
GCAATAGAGT ACAAATCCAA ACCGTTTGTC GGCTCATCGA GCAAAAGCAC ATCGGGATTA 
CCAAACAACG CCTGCGCTAG GAGTACACGT ACCTTCTGGC TTTCGTCCAA TTCGCACATC 
ATCCGATCAT GGTGTGCCTC ATCTACACCC AACCCAGAAA GCATTTGTTC AATGCAATTT 
TCTGCCTCCC AGCCATTCAA ATCCGAAAAC TCACCTTCCA ATTCTGAAGC CTTCAACCCA 
TCTGCTTCAC TAAAATCACT CTTTGCGTAA AGAGCTTCCC GCTCCTTCAT CACTCGATAG 
AGCGCAGGAT GCCCCATGCA TACGGTATCT TTCACCGTGT GCTGATCGAA GGAAAAATGA 
TCTTGACGCA GAACTGCGAC GCGCGCGCCG GATGCGATAg CGATACTTCC CTGATGATGT 
TCGAGTTCAC CGGAAAGGAC TTTTAAAAAA GTTGACTTAC CTGCCCCGTT CGCTCCAATG 
ACTCCATAGC AATTCCCTGC AACAAACTTT AAATCAACAC CTTTAAAAAG AGGTTTGTCA 
GAAAACTGCA CACTCATACC CGTCACTGTT ATCATGCGGC GCATG cTAGC GCAAAATCCG 
TGcACAGGaC AAGCCGCTGT CCATAGAGCA TCACACATAC AGCGATGCTA TGAGCGCGTC 
ACTGTGGAAA ATATACGTGC AATACACCTC GTTCATTTCT TACACACAAC TGTGcAGAGC 
CCCCTGTAGA AAGACAGGTC CCCAGTGTTT TCCTCACACG CTGATCATTT ATGTACACCG 
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CACCGTGGCC AGAAAATACT GAAAGTGCAT AGTACGACTG CCTTTCTGTA AAACGCGCAA 
CAACTGTGCC GGTGCGAGTA CCTATCTCAC TATTCCCTTG cAACGTACCA TCAAAAGATA 
GTGTGCCACG GTCCGTGTAT AAATGCGCGT CAGATAGGAC GCAGCGACTG CATTGCGTAT 
CACCGTCTGT CGTGTGCAGG AGAGTCCGAT CGGTACGTAC TCCATTGAGC TTTAGTTCGC 
TCGCGTGCGC ATACACATCT GcAAAACGCA CCTCAATACC TTCAAGCCGA AG ACTGC TG T 
CTTTCACCCG CACTTTAAGA TTGTGCACGT TGTGctgCGC TGgCACACAA ATGATCGCTT 
CAATCGGTAC CACGCTATGT CCCCATCGGC TCCCCCATAC GTTTTTCCAA AATGTGTAAA 
ACCAATCGCG CAACGTATCG CGTACGTTCA ACGCACTCTG CATTACTCCT GGGGAATCCG 
CTGGAGGCGT GGACTCCCGT ACTGTTTTCT TTCCACGCCG TAGGACAaGC ATCGTAGGAT 
CCAGGTGAAT CGATAGCGGA TCGTACACGC TGTTCTTTAC AACCTTGTAT GCAAGAGAGC 
GCCgnTGCGC ACATACCTGT ACACTCACTC GTACGCGAtA GCATCTATAA CAATGGTATG 
GGGATGTGCT TGATCTAGGC AGGTATATCC ATCTGCATCG GTAAAGGTAC GCACAACAGG 
TTGAGAACTG TGCGCGGGGT GACGCTGCAT ACGGCTACTG GTCCAAAACC TCGATACATC 
TCCACGCAGA TACTTCATCG ATCCGCCCAA AAGAT ACGC A CACGCTAGAG AGAGCGCACC 
AACCAAACAA AC G AT AACAG TGTGTGCACG CGCTTTTTTG TCCATTTTCT CCCCCTCACC 
TATTTCTCCT CTGTAGAGCC TTTCC TCCGT CCTTAAACTG AACACCAGTT AGTGGACCAG 
ATTACGCCGC ATCAGTACAA TCGCGCGCAA TGAGTGGGGA ATATCAATCT TTCACGCTCA 
AGCGTGCGCG ACGcGTCTAT GACCAGTATA ATGTGATTAA CTCCCTTTCG TTCGCACTCG 
TAACTGGCAA TACCATTACG CTCTATGCAC TGCTGCTTGG TGCCCGCAGT AC C AC GGT AG 
GCTTGCTAAG CGCGTGCATG CACTTTTCCT TCTTTGCACT CCCTTTAGGA AAACTTGTGT 
GCCGACGTTT TGGCGTCATT AAAACCTTTG CGTACACCTG GATCGCCCGC AATACTAGTT 
TGCTTCCAAT GCTCGCAATC CCTCACCTTT ATGCACAAGA CTATACGGCA CTTGCACTGT 
ATGTGCTTAT TTTTTCCGTC GCACTGTTTA ACTTTTTTCG TGGTATGGGA ATGATCGCGA 
ACAATCCGGT CATCACCATG CTCGCACCAG GCAAACATCG CAGCTCATAC ATCGTACGCA 
TCTCGCTTGC GAACAACAGT GCCATACTCA TTGCCACGCT TTTACTCTCC GGGGcACTGA 
GCGTTAACGC TTCACTCACA ACCTATCACT TTGCAACTGC ACTCGGCATC GCACTAGGTT 
TTTTTGCTTC GTTTCTCCTT TTCACATTAC CTACCGTCGA GTCATGCGAA CATGTGCAGC 
ACACTTCCCC GGAGACCCCA CGGACCTCAC CGCGCTCCGG GTACACCACG ATACTCCGTG 
CTCTGAAAGA GAAAAACTTT CGCACCTTTA CGTTCGCTTT TTTTGTCAGC AGCTTTGCCA 
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CAGGTACAGT ACGCCCCTTC GTTGTCGTAT TCGCAAAGGA CGTATACCAC ACTCCAGATA 7200 

GCTTTATCAC TATCCTCACC GTATGTGCAT CCGGCGGTGC ACTCATCGTC GGTTTTATAA 7260 

TGAGTTTAGC TATCGATCGC ATTGGGGCAA AGCCAATGTA CATTATCTCC TCAGTTTTAA 7320 

GTGTACTCAC CCTCATCCCT GCGCTTGGTA CGCCAGGACT CCATTCCTCT TTCCTTTCAA 7380 

TTGCTTTTTT ATGCCTGTTC TGTGCAACTA CCAGCATGGG ATTTACCGGA CAAGATAATG 7440 

CAGCGCAGTC CTATTTTTTT GTCCTCGTTC CTGAGGATGC TTTAATAGAT GTAAGTGTCC 7500 

TGTACTATCT TATTTTGGGC ATCACTGGTG GAGCCGGATC GGTGATTGGC GGCGTGGTAT 7560 

TAGACTTCTG CCATCTCTCA GGATACTCCA GTTTGCAGGC ATATCGTATC TTTTTTACAG 7620 

GAGTCAGCGC GATTATGATA ATCGGCATCG CGCTTCAGAC ACAGcTGCGC AACCTGGGTG 7680 

GATACCGTGT ATTGCGAACA CTCGCAACGC TTTGCTCTCC AAAAGATCTG CGTACTCTCA 7740 

GCCTCCTACA TAAACTCGAC TTTAACGAAA ATTTAGAAAC CGAGCAGCAT ATCGTACAAG 7800 

AACTTAGTAC CATCGCCTCT CCCATCTCTG CCGAACAACT GGGCACCTAC GTGCAATCGC 7860 

CACGTTTCAG TATCCGCGCA AGCgcATTGC AAGCACTGGA AACGATTCCC TCGCTGAGTA 7920 

CACACAACCG TAATCTTTTG CTGCGAGAAT TGCGCGAGGG AACATTCACT ACTGCCGCAC 7980 

AGGCGGCACG CATCCTTGGC ATTCATATGG TCCAGCAAGC AATTCCAATC CtGcgCGAAG 8040 

CGCTCCATAG CGAGGATTAC CTGCTCGTCG GAGAAGCGCT TGTaGcGTTA GCACGCACAC 8100 

ACGATGACGA AAGTCATTTC CTTATTGGGC ATGTGcTGGC GCGCACGCAA AATCCCTTTG 8160 

TCGTGCTGCG TGGCCTGCAA GCGCTTGAGA TGCTCAATTC AGTCCACGCG CTACCACCAC 8220 

TGTTTGAGAT TTTGCGCACA ACGTGCAAAA ATACACAAAC GCACACAGAA GCATTACTGA 8280 

CTCTATCGGT CTTGATGGGA ATACAAAATG AATTCTACTT TCTATTTGAG CGCTACgTAC 8340 

CGGTCATACA ACCGTACAAG CGCTAGTACG AGAAAAACTA GAAGAAAGTT TTGCTATCAG 8400 

CAGGGTCACT GACGCGACAC TTGAGAAAAC ACTGGAACGC TTTACGGCCG ACGCACGCGC 8460 
GGGCACCCAC GTGGTCATGT GGGTACTGGC ACGCGCAGGA GAAGACCTAG GGACAAAAAC 8520 
AGCACTCCTG CTGAGTCTTA CGTTGGAGAA TCCCCTGTGC GCGCGAGAGG CTTTTCGCCT 8580 
TCTGATAGGT ACATGGACGG CCACCTTGTT TAGAAAACCC GCACTCATGT GCTCTTAGCG 8640 
CTCAGACGGC CCGGTGCGCA CAACACGCCG CAGGACGTGA TCGACCGTGA CTATCCCCCC 8700 
TAAAACCGAA ATCGCACGGT AGAAAGCGTT TGCCCATCGC GCAACACGTC AAACCACACC 8760 
TCCCTCgTnT GACTGCAAGC ACCGCGTAAA 8790 
(2) INFORMATION FOR SEQ ID NO: 14: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 651 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
nCCAnTCGCG GAAATTAACC cTCACTAAAG GgAACAAAAG CTGGAGCTCC ACCGCGGTGG 
CGGCCGCTCT AGAACTAGTG GATCCCCCGG GCTGCAGGAA TTCGATATCA AGCTTATCGA 
TACCGTCGAC CTCGAGGGGG GGCCCGGTAC CCAATTCGCC CTATAGTGAG TCGTATTACA 
ATTCACTGGC CGTCGTTTTA CAACGTCGTG ACTGGGAAAA CCCTGGCGTT ACCCAACTTA 
ATCGCCTTGC AGCACATCCC CCTTTCGCCA GCTGGCGTAA TAGCGAAGAG GCCCGCACCG 
ATCGCCCTTC CCAACAGTTG CGCAnCTGAA TGGCGAATGG CAAATTGTAA GCGTTAATAT 
TTTGTTAAAA TTCGCGTTAA ATTTTTGTTA AATCAGCTCA TTTTTTAACC AATAGGCCGA 
AATCGGCAAA ATCCCTTATA AATCAAAAGA ATAGACCGAG ATAGGGTTGA GTGTTGTTCC 
AGTTTGGAAC AAGAGTCCAC TATTAAAGAA CGTGGACTCC AACGTCAAAG GGCGAAAAAC 
CGTCTATCAG GGCGATGGCC CACTACGTGA ACCATCACCC TAATCAAGTT TTTTGGGGTC 
GAGGTGCCGT AAAGCACTAA ATCGGAACCC TAAAGGGAGC CCCCGATTTA G 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5338 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TACCCTTTCT CCTTCAGTGC GTAtCTACAG yTATCGCACC AGACGCCACT TACAGCGTTG 
GCGCGCCTTT TTGTCACGCA CGAAAcTGCG TATGTGCCTG CTATCCCCCC CACGTCTGCC 
GTGAGCCGCC CTTACACCGG TATCCTCATA GATGCGCGCG GTTCTCTTCC TGTGCACGGC 
GAATACGTGT CAGAGCCGCT GAGCGCATGT TTGTTCCCCA AGATTTGGAG CACGGACATG 
GATTTAATCT ACGAAAAGAA TATGGTTCAC CCTGACCGTG CCAAGGCATG GGGTGTGGTG 
CGGTACGGCT CGGTTTGGGA CGAGAAAATG TACCGAGACA GGATAGGTAC CACGCCCTTA 
AAAATCATTG CGCGCGGAGT GTTTGGCCAG CAGCGCACGG ATCCTATCAT TGCATCAAAG 
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GATGCAGCCC AGATCTTGGC GCGCCCTGAa GAACTTGCGT TTGCTTGCAG AAGGCAACGT 
GATTATCCTG TGCGACGAAG CAGCGCTGCG TGTGCACGTG CCGTATCCGC TTGTAGACGA 
GCACTTTTAC TTTGCATACC ACGACGTAAA ACGCTTCCTA ACCGACGAGC GGTCCCCCGG 
TGTCGGTGTT CGCTCTGGCA TCAATACCCT CAAGATCACC GTGTACGACG TGCGTTTTGT 
GGCAAACTCC CCAGAGATTC TCGCCTCAGA AAAAGATCGG GTAGACGTGA TAGCAACCGC 
ACTGAAAAAG ATGGGsCCGT ACACAAGkTT TTTAATTGAA GGCCACACCG CAGATTTACA 
CCGCCGTCAG GAGGAAGCGG CGCTTTCTGT AGCACGTGCG CacGCATGGC GCAGGAACTG 
TCCAGACGTG GCATTGAGAT GACGCGGATT ACTACGGCAG GACACGGTGC GACAAAGCCT 
ATCGCGCCAA GCGATaCGCA CGCGAACAAA GCCAAAAATC GTCGAGTGGA GATCACCATC 
TTGcGCGATT AGTGCACGTA CCACGGAGCA TTCTCCGTGC CGGCTATTTC TCCCAAGTAA 
AG AGAACC TG CGATGACGTA CCGATGGCTT TCTGCAGTCA GGCGCAGTTA AAAGGAAGGA 
GCACTATGAT AAAGCCACGC GCGTATGCAC TGTTAGGCGT GTTTTTCCTG TACGCCTGTG 
CAAGCACACC ACGGGAAGAA GATGTACCTG AAAAATTCAC CCCCGCTGAC CTCATGCTGC 
GTGCACAGGA ATCCTACGAC GCAGGTAATA TAACGTGGGC GCGTTTTTAC TACCAAACGG 
TTCTCGATCG TTTCCCGAAC AATGAGTCAG C GGTC ATT AG TGCAGAGTTT GAACTTGCGC 
ACATCCTTGT TAAACAGAAA TCCTGGCAAG ATGCCTACAA TAGGCTCATG TATATACTCA 
AAAAATATGA GGCTGCAGGC AGCGCACGCC TGCCTCCTGC CTACTACAAG CTCACACTCA 
TTGATCTGTC GCGGGTAAAG CCGCACTTGA ATCTTGAGAC AGCGAATACA AAAGCAACAG 
AATATCAAAA GAACTACCAA GAAGAGCTCA AGCAACGCCA GGAACTACGG CAAAAACTCT 
TACAAGAACG CACACAAAAA ATGCTTGAGG CTCTCCATCA AGAAGAAACT CCCGAACAGG 
ACGCGCGCGA TACCGCAAAA AAGAAGACAG ACCAAGAAGA ACACACCATG CGCAAAGCAA 
ACGCGCCTAA AACCAAAGCG TCTGGAGAAG CACCCACCCC ATGAAGATCC TGCACACAGC 
GGACCTACAT CTAGGCAAAA CACTCCATGA AGTATCGCTT TTTGCGTCAC AGAAAAAAAT 
GCTCGGCGAT CTGTGCACCC TCCTTGCGCA GGACAACTAC GCCGCGCTCA TCATCGCAGG 
CGACATCTAT GACCGCTGTG TACCCTCTGC AGAGAGTGTC AGTCTTTTTA GTTCTTTTTT 
GCAAAATATC AAACGGTCCA TGCCACGGCT CCCGATATAT CTCATCCCCG GcAACCATGA 
TTCTGCGCAA CGTCTCTCCT TTGCCCAGGA GCTACTTAAG CAGCAGGGAG TATTCATTGC 
GCAGGATCCT GAAGAGAGCA CCCGTCCCCA TCTCCTCTGT CACGAGGGGG AAACAGTGCA 
GTTATTTTTA CTTCCCTTTC TCCACGCAGG TGCCTTTTCC TATCTTGaTG AGGAAAACAC 
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CACTTGTCTC ATTCACACCC AATCCGAACT CCTTCAAGAA GCCTCGCGTc GCTTGCAGcG 
TGCAGTATCG TTGGACACCC CTTCTATCCT TGTCGCACAC CTATTTACCC AAAAAGGTAT 
TAGCTGCGAA AGTGAACGCC CGTTTGTTGG CAATGCCGTT TACGCTGACC CACACTGGTT 
TGACTTTTTC ACCTATGTTG CACTTGGTCA TTTACACAAA TGTCAAAAAA TC AC CGAACG 
CATGTACTAT TCCGGATCTC CTTTGCCCTA TTCGTTTGAC GAAGCAAATA CCCAAAAGGT 
TGCGCTTTCT GTAGAGATTC ACTGCAACAC AAAGGGATTC CCCATCCATG TGACTCCCCT 
TCCACTTGAG CCACTTATCC CTCTTCGCAC CATACGCGAC TCATTCCACG CACTATATAC 
CGGTGATCGC TATCTCCTTT ATCAACGTGA TTTTTTAGAA ATCACCCTGA CCGACCCGGC 
GC TCGTGC AC AATCCTATTG GCCTTTTGAA GCCGCGCTAT CCAGGATTGC TC AGTATCAA 
GCAGGAAAAT GCGTTCGCCT TTGATATACC CCCCCCCTAC TCCTCTAACG AGGGGATAGC 
GCCCTGCACA CACCACTCAT TGCGCACACA CTTTGATGTA TTTATGCACG AAGTAAGCCC 
CACTCCTGAT GACAGAGAAA AGGGCGCTCT CTTTCAGGAA CTTTTTGACG AAATGCAACA 
GGAATTCTCA TCGTGAAGCC GATGCGTCTT ACGCTCCACA ACATCGGTCC TTTCGTTGGC 
ACCCATACAG TTG ACT TCAC CGCGCTCGGT CCTATTTTTC TAGTGTGTGG GAAAACAGGT 
TCAGGAAAAA CCACTCTATT CGATGCGATC GCCTATGCCC TGTATGGGAA ACCCCTTGGA 
ACCCGTGCAG AAGTTATCCG CAGTCTGCGC AGTCATTACG CCGCACCATC AGAAGCTGCA 
TTTGc TACGC TGGAATTTTC ACTCGGCACT AAAATCTACC GGGTACACCG GACGCTGACT 
TGCACACTTT CCCACAGAAA AACAGAGCAA CCCGAGCAGC TGTATCTTGA GCAAAAAAAA 
GGTCATGGAT GGGAGCGTAT TGCTTGTGCG CATAAAAGTG AAACTGAATG TGTTATTCAC 
GATCTTCTCA AACTCAATAG CAAAGAATTT GAGCGCGTGG TTATGCTCCC ACAGGGAGAA 
TGTGCGCAAT TTTTAAAgCA AATTCAAAAG AAAAAAAAGA AACGCTGATG AATCTATTTC 
CTGTTGATCA ATATACTGCT CTTATGGAGC GAGCAAAAAA AAAATCGCTC CATGCCAAAG 
CAGTGCTTGA AACGCTGCGT TCGCAACTTG AAACTCTATG TGCGGAGTGC ATGCCCGACA 
CATACCACGA AAGGAAACAA ACGCTAGAAG CTGAGTTACA GCACGCACGT GACGCACTGC 
AGCAAACCCG CATC TCC CAT GCGTACTATA CACAAAAACG TGAAGCGCTC GAAGCACAGC 
TAAAAAAACA ACAACTTTGT AAAGAGCTGC GTGCGCGTAT AGAAACATAC CGCGCGCAAG 
AACCAGTCCA CGCGGAAACT CAAAaGCGTA TTGATCGCGC GCGAAAAGCG GCACCACTTn 
TGCGCACATA AAACACGTCA CCCAGTGCGA ACAAGATGCA CaGCGCATTC ATGCAGAAAT 
ACAGGAAAgA TGCGTTCACG CGAACAATTG CTCATGAAAC GAAGTGCGCA TGTCGCGCAG 
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CAGTCATCCA TTGAAGAACA ACGCCGTCTA CTACAAACAC TTCATAGTGC GTGCATTCAC 
ATTGAAGACG CGCATGACGT TGCCACGTCG ATACGCGACA TATCTTGTCA GGCGCACACA 
CTCACGCAGC ATATCCACAC GCTTGCACAA CAAAAAACAA CACTTACCCA GCAAGAACAA 
TCGTTGTGTA AAGAAC TGG A TATACTGCAA AGAGAAGCGG GTACTATCGA TACTCGTACA 
TCTGCCTTTA ATGATTTACA AATTCAACTC GCGCATGCAA AGAAGACACA AGAATTGTCT 
CAGCGATATG CCGAGCTCTG TGCGGtCACG CAACATGCAC TGCACAATGT GAAAAACTTG 
AGAAAATACA CGCACAAAAA AGCGCGTATA GCACACGGGC ACGTGAGCAG CTCCTTCAGA 
CAAAAGAACA AATTCATCTC CAAGAAACCC GGACACACGC GGTAGTACTC GCGCGTCTCT 
TAGAGCATCA AGAACCGTGT CCTGTCTGCG GCTCTTGCAT TCATCCGAAT CCCGCACGTC 
AAGACATAGA TAATCTTGAA CCGTTAACCC GGCGCATGCA ACGCATAGAA CAAACATACG 
CGCAGcTGGa AACCAGCGAG AAAGATGTGT ACCACATCCT CACCTCTGAG CGTGAGCGmC 
GTGCATCCTA CAGTGCACAA ATGCAGGAAA TACAGCATTC ATTTTCCATT CTTACATCGT 
GTGATACGCG ATCATCCTGC GATATTCCAA ACGTGCAAAA AATTACCGTA CGTGTTTTGG 
ATCTCACGGA AAAATTATCT CGTGCAAAAG ATATGCTCGC ATGCGCGCAA CACGCTTTAC 
TGAGAAAAAA ACAGCCTGAG CAGGATTTAC AGGATGTACG CGCACACCTG CAGCAATGCT 
CACAAGAGCT CGCAAAAAAA GAAACAGCAC TCC AC GC ATT GCAAGAAACG CTTACACAGC 
AGCGCGTACG CATTCACGCA CTGTCCATAC GTTTACCCAA GGAATTGCTT GCATCGAACC 
TACTTGCTCC GCAAAAGATG CAGCATGAGA AGGAGAGTGT CGCCTATTGG AAAGAGATGC 
TCGCACACTG TCAAACCCTT ATGCGAGAAT TGCACACCCA TATTGAAGAA TACGACCGAG 
AGTTCAATGA GATAGAAAAC GCTTCTAGTG CGCTTGGCGC CGACATTGCA GCGCGAGAAG 
ATGCACTGAA CCATGTTCAA AAAGAATACA TGCAC CTTGC ACGTACCGTG TGTTGCGCAC 
GAACAGAAGC GCATTTtCAA TAACAACGAr GAAGTAACCG CCGCTCTTAT GACTGATGCT 
GAACTTTCTC ATGgCTGCAG CAGAAATTCA ATTTTTCAAT GAATTGCGTG CGGCTGACAC 
CCATCTACTG AAAACACTCG AGGGCAGAAA TAGGAACAGA AATTCCATCC GATCTTGA 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32768 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CCGCGCAAGA TCCCAGCGTT GATATCGCTC CAACCCCTTA ATCACCACAA AGTTCAAGGG 
AGGGAATACG CTCCCGCGGT ACCCCATCCC CCGCTCATCA AACTCTGCTT CATCTGCAGA 
CAAACTCGGA ATCGGATGGT CCAGCCCAAA CGTGTGAGGA TTCACCAGAT GCTCTGCCAA 
GCGCTCTGCC TTGTCTTCAT TCGGTATCTC CCCAAGCATA GGCCAGAAGC CAGCAATAGT 
CTTATGCGGA AGCTGCTGCC CGGAAGCGTC GAGGTCGTGG TAAAAGCCAG TACTCGCGTT 
CCACATAAAA TTATTAATAC GTGTCTTTAG GGTAAAATAT ACCCGCTTAT ACTGAAAGCT 
CAGCTCCTTA TCGTTGATAA TATCGCCGAG TGCAGAAAGA TAAAAAGCGC TCACGGCCAA 
GGCAGAGTTA AAATCTACCA GATAGGCAGC TTTTTTACGT GGAGAGTTTC CCATCTCCGT 
AGCAGCAAGA GGAACCCTAT AGAGTCCGTT ACTCCTTCTA AACTGTGTTT CAATCCACTT 
CATATAGCGC ACCATCACGG GCATGATCTC TTTAATTCGT TTTTTATTTG CAGTTTTATG 
AAAGAGATTA AACTCTGCCC AGGCAAAAAG AGGCATGCCA ATACCCTCAG GATTGGCGCG 
AGGCAAAACT GGCTCTTTGC TTGCAAGATG ATACTTCCAA CGAATAGCGC CGGACTCCTC 
CTGCATTGCA TAGAAAAAAT CAAGACACTG CGTGATGTCA TAGTTCCGGT TCGAATACAC 
GAAGAAAAAG GACGCAAATA TGATTTCATG CTGACTGATA ATTAATCCGT CTTTTTCTGG 
AAACACAAAA AACGATTCGC TTGTGTTTTT CTCCCCCGAA GCAGACAGCC AATACTCCTT 
TATCCAAGCC CACGTGCGAT CATAGATGTC AACAAAATCC TGATCATAAA AATGAATCCT 
GGGAAAGTCT CGCTTATTCA CCGcATCTCC TCACACATCA CAGGCGACGG AGTGTAGCAC 
ATGCaGGGGA AAGTGAGTAT CTACCTTTCC ACCCTGTAAG CTACGCGATG TGCACACCGG 
CATCCAGACG AACTAGGATA GTAAGGTGTC AGAGGATAAG CTGGCACGTA ATAATTCACG 
CGCCGTACGT TCTTCATCAT ACGGTTGCAC GGTGTGCGCA TAAATAATTG AATGCTCATG 
CCCCTTACCC AGCAGCAGCA CAAGGTCCTG CGCACGCGCA AGAGAGAATA TGTGCCGCAG 
AGCAGCGACA CGATCCGGAA TCAGAAACAG GGTTTTACCC AATTTCTTGT GCTCACAACC 
TGCCGCAATC ATGCACAGAA TACCCATgGA TCCTCTCCTC TCGGATCCtC ATCTGTGAGC 
ACAATTACGT GCGCATAACG AGAGGCAATT GCGCCTTGCA T t GCACGCTT TgCGTGTcCC 
GCTTCCCCGC CGAGCCGAAC AATACCAACA TGCGCCTCTT AACGCGnGCA CACGCGCTGC 
AAGCGGTGGC AAAaTCTCCT CGAAGGAAGA GGkTGTATGC GCATAGTCAA TGAGCACCTC 
AAAaTCCTGT CCCaTATCCA CACGCTGCAT TCTCCCCtGG ATTGGCTGGA CGTACTGCAC 
GTGCTGTGCA AAAGCCGCAA GCGACGTACC AAGCAATCCA TGCAGTACAA GAAAAGACGC 
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TGCTATATTA CAGGCATTGA AAGCTCCTTC AAGCGGCACC GATACATCAT GTGCTCCGTC 
CTGTGCTGGT TGTGCAGGTT CCTGAACAkT TGACAACACA AACCTTAATC GCAAGGCCTG 
AGATATCTGA GGAAGTGTTT GC AC CCATAG CAGGGTACAC GGCATCCTTT CCAGGCAGGC 
TGCTGTCCTT TGCTCAGCGC CGGTTCCTCT CTTAAAGAAA AAACAGGGTT TGTGCcGTCT 
TCGCGAAAAT ACACAGCCGA TGCGTCTTCC GCCCAGAGTA CTCCAAAAGA AGGAACGCGC 
CGTCCGTCTT TTATGTGATC ATGCGCATCT AGCGCACGAA ATACATTTGC TTTATCAAAG 
CGATATTGTT C AAACGAAC C ATGAAATTCT AAATGTTCAT GGCGTACGTT CAT ACAT AC T 
GCCACATCAA ATGCAACATC CTGCAAACGT GCCGTACGTG TGGAAAGCCC GTGGGACGAC 
GCTTCAATTA CTGCAAATTC ACAGCTGTGC TCCCGCATCT CAGCGAGGAG CCGCTGTACT 
GTTAGCGACT CCGGTGTGGT TTGATGCTCT GCGTTCGGGA GAATATCATC TCCTAACGAA 
TACTCCACAG TAGAGATAAA ACCAACTCGT TTACCACATA AACGCAAAAG CtGCGCAATG 
AAACTAACCG TGCTGCTTTT ACCCTCCGTG CCAGTGACCC CGATAACTGT CAAAGCACGC 
GTAGGAAAAT CGTAGAAAGC TGCAGCAGCA CTAGAAAGCG CACACCGTGC ATCTGGTACA 
CGAGCATAGT ACACGCCGAC GACATACGTA TCTAATGGAC AATCATGCAC AATTGCGCAG 
GCGCCGGCAT CAATTGCTGC GTGGATGTAC TGCGCGCCGT GCGCATGCGT ACCACGCAAC 
GCAAAAAAAA CCGAACCCTC ACGCACTGCG CGTGAATCAT ACGCTATGGA AGAAACGTCC 
GccACACTAC CGTGCGTTTC TTGCACAGAA CAGGAGGCAA G AC AGACAGT AATGGGTTTA 
CGGTACAGCA TCGCGGGTCT GATTGTATCT GATTGCACGC CCTCGGGGAA CAATCTATTG 
TCAATGCTTT TTCAAAGAAG ATCGCAAACG GTGGGGAAAG GCCATCTCGT TGACAGCCTT 
TTAGTGATTA AC TT ACACTC CGCCGCATGA AAATTTGGCT CAAATTTTTT GTCGGCAGTT 
GCATTGGTGC AC TGGTAGCC TACACTATCC CAGAAACGCT CAGCGCGCCG CTCATGCAGA 
CCATTTCAGA ATTGGTTGTA TCCGCTGGGA GTTACATGCT TTATCCAGTT ATTTTTTTTG 
GATTCAGTGT CAGTATTTTT GAGATGCGTC GAGAACGCCT ACTCCTGCGT ACTACCCTTA 
TCAGCATAGG TGCATGTGTT GCCACCGCAT TTAGCCTTTC TTTGGTAGGA CTATTCTCGG 
TACTCGTGTA CCGACcTGcG CGTATTCCCA TTTTTGCCAC CGGCACGCCG CAGAATCCAG 
GGTTTCAAAT CCGCACCTTT TTTTTGCAAT TGTTCCCTGC AAGTAGTTTT GAAGTATTCA 
CAAATGGTGA TTATCTTCTC CCtCTCTGcG TATTTGcCAG TTTCGTCGGC GCCGGCTGCG 
CAGTCGATCA TGTCGCGGCA AAACCCGTAC TCGCGCTTTT TGAGTCACTA ACGCGCGTCG 
CACACACCGT GATGGTCTTC TTCGTAGACA TGCTGTCTAT TGGATTTATT GCACTTTCTG 
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CGCACTGGCT GTTTAGGTTT CGACCACTCC TTTCTACTGG GGTGTTCACT GACCTTGTAA 
TCCTACTGAC ACTGACAGCA ATTTTTATCT GCAGCGTGCT CTATCCTGCC CTTATTAAAA 
TTATGTGCCC TGAAGTCAAT CCGTATCGAG TACTGTATGC AGCATTGGCA CCAATGAGCA 
CGGCGTTCTT TTCGCAAAAC GTGCACGCGA CGCTCCCTGT CTTGCTCTAT CACGCAGAGG 
AAAGTATAGG GGTGCAACGC ACAACTGCAA CGGTGCTGCT CTCTATCTTT TCGATCTTTG 
GCAGGGCCGG GTCAGCGTGC GCAATCACGA TGAGCTTTGC CCTAATATTA AAGTCATATT 
CCCATTTGGG AATCGGcTTC TTCGATGcGc TGTGGATTAT AACTGcTGcA TCATTtCTCT 
CCATTTTCTT AGGACGCTTT CCCACAGGAG GGGTCCTTAT TGCGCTTGCG TCAATATGCG 
CGTGGTACGG ACGAGGTTTT GGAAGCGGAT ACCTTGTCAT CCGCCCTGCT GCATTTTTTG 
TTGGAAGCAT CGCCACAACG CTGGATACCC TAAACGCCCT CATCTGCACC GCAATAAGCG 
CAGAACGAAT TGGAACTGTG CGCCACCGCG CGGTGCGTTT CTTTATCTGA GCTCTAGTGA 
TTGCACTGCA ATAGCAGGAG ACTCCAGCAT TCGATGCGCC CACGCCCCCA TACGCGCAAC 
CGCTATGTGC GCACACGGCG ACACACTTTC CATCGACTGA AACAGGTGCC ACATCCCCGG 
CCACACGTCT AGGGTCACTT GTACCCCCGC CCCTTCAAGT ATCTGCGCAA GCGCACATGC 
GTCTGTGTGG AACAACTCTT GCTCCCCACA TTGCACAAAC ACCGG AGG a A ATCTCCAAAA 
TTCCCAAAAA GGGGGGAAAC CAGTGAATTG CGAAAATTAT CCGCGTACGT GTACTGCAAC 
GCACAGTAGC GGAACATATC GCGCGTCAAC AGGAGTTCTT TCTTCTTAAC TCCCTCCCCT 
GCAAACCGAT CCTCAGTTAA ATCAACCCAA GGAGAAATAA GcgCCAAAGC GCGCGGnACA 
CACCAGCCCC TTCTGTTTTA AATAGTGCGT CAGTGCAAGC ATCAACCCTG CACCTGCTCC 
ATCCCCACTG AAGATAATAT CTTCAGGACG AAATTTCTTC TGATCAATAA GTGCTACATA 
CGcATCATAC ATATTTTCTA GTGCAGCAGG AAAAgGATGC TCGGGCGCAA GTGgATATGC 
AGGArTATAA AACTTCGCGC CGACTTCATC CGCTAAAGAT GCACAGAGCG CACGAGAAGC 
CATAGGAGAA CCACTTATAA AAAATCCACC ATGTGCGTAC AACACTGCAT GGCCAAcCAT 
TAGGAAmCGC GGGCTCAACA CATCTGTTTC GATATTAGCC AATACCTCAC AGGAAACATC 
CACCCCATTG GGCACATACG GCATATAAAA AAAGTCATCA TACCCCGCAC GCAAGGCAGA 
AACCGATGTA CGCGGGGTAA AGCGCATCTT CTTAAACAGT TTTTTCGCCA TTCGGTGCAC 
GTGAGCGCGC GAAGGTCCCA TGCGTTTATC GTAACACAAA GAAACTGGTT CTGTCAGGCA 
ACCGCTTCAC GCCCATAAGA GCGCTTGACA CGTCCGGCGC ATTCCCGTTA CGCTCGGCCC 
CGGTTTCGTT AGGGCAATTA GCTCAGCTGG TTAGAGTACA AGCATGACAC GCTTGGGGTC 
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ACTGGTTCGA 


TCCCAGTATT 


GCCCAGGAGC 


TCCTCTATTT 


CAGACCTGGC 


CCATTTTTTC 


5220 


TGTTTTTGGC 


AAAGCGGGGG 


GACGATAGCG 


GGTCGCCCGT 


CTCTTTTCCC 


CCTCCCTTTG 


5280 


AGGGGACCTC 


CCCGCTCGTA 


GAgGGGACGG 


GGTGCTCTCG 


CTCAACCAGG 


AGCCGATCGA 


5340 


TGCGCCTACC 


GTCCATTTTC 


AAGATAGTAA 


ATCGGCATCC 


GTTCGCATTT 


AACTGTTCTT 


5400 


TT AC AC GC GG 


GATTCGATTG 


CGTATGCTCA 


GCACATAACC 


GGCGATCGTG 


TGCACACCCG 


5460 


TGTGTGGCCG 


CGTCGTCCGC 


GCCAGAACCC 


CGAGACGATA 


CATTTCGTTT 


AAATTCATCC 


5520 


ATCCGCTAAC 


GATCCAGCTA 


CCGTCAGGTT 


CTGAAAACAA 


TCCTCACTAT 


TGATCCCGTC 


5580 


GGCACGCGCA 


TACTCCTGAA 


CGCAACGCAT 


AATCAACTCA 


TCACGCGTTA 


CCATTCCCTC 


5640 


AATCCCTCCG 


TACTCGTCAA 


TTACAAACGC 


CATCTGAGCC 


TGCATCTGTT 


GAAAAAGATG 


5700 


CAACAGCTTA 


CGCACACTTA 


TTACCTCAGG 


TACAAAAATC 


GGCTGCTGCA 


CTATACGCGC 


5760 


TATCACCGGA 


TCTGCGATCC 


ACACCCCCTG 


TCCTTTTGCA 


TGAGCGTGCC 


GCCGACCCTC 
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CATGTGAACA 


TGCAATGCGC 


TTGCAACAGA 


CTCTTGCTCA 


TCCTGCTCAC 


GAGAGGCTCC 


5880 


CGCGCTCCGT 


TCCATTTCAA 


GACATACGCG 


CAC AT AC CGT 


TGCACAGAAA 


AATAGCCCAC 


5940 


GcTGCGTCAA 


TCGTACGAGC 


ACACACAGGA 


AAATAGTTGT 


AATCGTCATG 


CTGCGAGATC 


6000 


ATCGAGAAAA 


TATGAGAAGG 


GAGTGCGCGC 


GCCTCGACCC 


ATACAATATC 


GGTACGGTGC 
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ACCATGTAGC 


TTCCAATTCC 


CTGATCGAAA 


CACACAGCAC 


AATCaGCGTA 


CGACATCTGC 


6120 


aGCATAGAGG 


CGCGTGCcGT 


TGCCGCTGCC 


AGCTTTTCCC 


CATAAGCaAA 


GAAAAAATAT 


6180 


TCGCTGCCCA 


ATCCCACTTC 


ATACGCTGGC 


CCACAAACCC 


AATTGTAATA 


AAAATTTCAT 


6240 


ACGGCTGCGA 


CTCACACTTC 


TCCACCTCGC 


ACCAAACGCT 


ATTTTAAAAG 


GCGCGCTGCC 
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TTTCAAAAGG 


AGAGTGCAGC 


GAGCACAAAG 


CAGCCGACCA 


CGCGTTTTGG 


ATACCCGTGT 
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GAACCTCGCA 


TGCAACGCAC 


TGCCGGGTAA 


GACATATGCA 


ACCAACTCTT 


CGTTATTTTC 


6420 


AAGAACTACC 


TACAGTTTAC 


ACGCGTGGCA 


CCGTTTCCTC 


AAAAAGATAT 


CTATGCCCCT 


6480 


GCCGCCGCGT 


CCGGTAGAAC 


ACCGTGCGTG 


TGTGTGCGCT 


ATTCATTACG 


TGCAACAGAG 


6540 


CAAATCCTGC 


GTGCGAGCTT 


CCTCGCGCAT 


GCTCGAGACT 


ACCCGGATTG 


ATTATAAGCA 


6600 


CGCGGCTACT 


TACACACGCC 


CTTTGCACGT 


GGGTATGTCC 


ATGCACCGCA 


ATACTACAAC 


6660 


ACGCCTGCAG 


TGCTTGGGAA 


ACGAGCACAC 


TATCGCTCAC 


ATTCACCGAA 


TGGGTGTGAC 


6720 


CGTGCGCTAG 


AAAGAGCGTA 


ACATCTGCCA 


CCTGGATACG 


TCCACATAAG 


GGTATGTGCG 


6780 


AAGCACGATC 


ACAATTTCCT 


GCAACCATAA 


AAATAACACC 


TGGAATACGC 


TCCCTCAGCA 


6840 


CACGATTACG 


TCGTCCCCTT 


TGGCAGAGAT 


ACAACACATC 


CCCAATCCCG 


TCCCCTGCAA 
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AAAGAAGGGC ATCTGCACAA GAACCAAACT GATCTACCAC CGCGGTCAAG GCCTCCGCGC 
TGCCGTGCGT ATCGGAAACC AGCAACAAAC GCGCACAAGA CAGCATATGC AATGAGGCAA 
TCGACTCCCG ACCCCCTATC ACTCCCGGTG CAGTCATCTC AAGCGTATTC ACATTCTATC 
CCTTTCGTGT ACGCTCCTTC CCGGACTGCC ACCATACGAA CGCACAAATC TGAACGTTTC 
TACCCGTTTT GACAGCAACA CATGATTGTA GGCACGCACA CCCGTCTCCG GGTCTACGTA 
GCATTCCATA GCATAGTCGG AGGAAAAAAC ACCCTGTTCC CGAAGGGAAT CAGTGACATT 
CGCAAATGCC CCTCTGAACT CCTGCAGAGG ACTTGCATGT GCTCGACGCG CGACACCCCT 
TCTGCGCACG ATTTCACATG CTGCACGCAG GAGCGACTGG AG tCCGCTTC TGTATTTGAA 
ACCTCCGCCA CTCCTTCTAG AGAATCTTCT TTTACCTGGA AATCCACGAG AATATCGCTA 
CCTGCCTGCA TAGGAAGCAC CTcTTcCTcA GGCAAAAGAG AAACGATAAG CGTTTGCGCA 
CGCTCGCGTT TTACCGCGAG CAAGTACTTA CCACCTCCTA CAGGCAATCC CAAACTGACA 
ACCGCATCAG GCCGCATTTC ATGCACGGCC TCGAGCACAA CCTTCATGCG CGGCAAGGCC 
GAGAGGCGCG TCAAATCCCC GTACTCCACC ACTCGCGCGC GCATGcgCCA GGcCTCGCAC 
ACCCGCTCGA CGCAATCGAC AAGATAAACG TGACCTGAGT ACTTTTTTCC AAAAACGAAC 
AGGACCTTCT GCTGAGAGGA AGGAGAGATC GGTACCACTC CAGACCTCTC CGTACTCTCC 
CGAACGCACG AACACGCCCC GAGAGAACAC CAACACACGc ACAAGAGACG CGCGAACTGT 
CCTGCACGGG CGCCCCTCCA ACCCCTGCAG AACTTCATTC AGCACACGGG GAGACGCTGA 
GCGCTCTCCT CGCCCACGAA AGACACCGTG CGCGCCCGGT CCCTAGAACG GACGGTCCGC 
AAGtACTTGT ACTCTTGCCA CTGTCTCCGC GCCTTCTCTT CTGCGCGCTT GAGCAGCATT 
TCAGCTCGCT CGGGATTTGC ATTCTTAAGC GTCTTGAACC GAACTTCTTT GTACATGAAA 
TCCGCAAGCT TAAAATCAGG TTCCTTACTG TCAAGCTGAA ATGGATTTTT TCCTTCCGCA 
ATGCGACGGG GATCGTAGCG GTACAACGGC CACAAACCAC ACGCGACGGC CTCTTTCTGA 
TTAATCATGC CCTTGGACAT ATCAATCCCG TGGCTAATAC AGTGGCTGTA GGCGACAATA 
AGCGATGGAC CATCATAACT TTCAGCCTCT CTAAACGCCT TGACCACTTG ACTCATGTTC 
GCTCCCATCG CGACACGTGC CACGTACACA TACCCATAGC TCATGGCCAT CAAACCAATA 
TCCTTTTTAC TGATCTCCTT CCCCGCCGCG GcAAACTTTG CGACGGCCCC GATAGGCGTG 
GCCTTCGACA TCTGaCCACC GGTGTTGGAA TACACCTCCG TATCCATAAC AAGGACGGTA 
ATATTGCGCC CAGAGGCCAA CACGTGATCT AGACCACCGT AGCCAATaTC ATAGGCCCAG 
CCGTCTCCCC CAAAAATCCA CACCGAGCGT TTGATAAGGT GGTCAACGAG AGAAAGCATT 
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TCCTTTGCAA GGGGkTCAGt ACTCTCACTG AGCACTTtCT TAAGCTGATT AACGTAGgCA 8700 

CGCTGCTCTT CCACTGCAAC ATCGTCCGCC TGCTGGTTAG AAAAAATACT CGCAAACAGA 87 60 

TCAGCCGCCA CCCcTTTTTC CTGCAAcTTG CGTCCAACcT CGCGGGCATA CTCTGcAAGT 8820 

TTGTCACTAG TCACGCGCAT TCCGAAGCCA AACTCTGcTG CGTCTTCGAA AAGAGAATTT 8880 

GACCAAGCGG GGCCGCGACC ATCAGGACGC GTCGTATAGG GGGTTGTAGG CAAATTTCCC 8940 

CCATATATCG AAGAACATCC GGTTGCATTC GCGATAATGG CGCGATCCCC AACAATCTGC 9000 

GTCATCAAGC GGATGTAGGG GGTCTCCCCG CAGCCTGGGC AGGCACCAGA GAACTCAAAA 9060 

AGAGGTCTTT TCATGGACGC CCCTTTTGGC AGACTCAAAT TGAGCTTCTT CGCCTCAGGA 9120 

TCGGGCAGTT TAACAAAGAA GGCCCAGTTC TCAGACTCCA CCGCACGGTG CTTGGAAAAA 9180 

CTTTCCATGT TGATAGCCTT ACGCGTAGGA TCAGCCTTAT TTTTTGCCGG ACATTGCTGC 9240 

ACGCACAGGC CACAACcTGT GCAGTCCTCT GGGGAAACCT GAATCGTAAA tTCGCCTCCC 9300 

CAAATTCCTT GCCTTTGTAG TCACAGGAAG CAAACTTAGA AGGCGCATGC TCGAGCTCCT 9360 

TACCATCGTA CGCTTTCATG CGGATAAcTG CGTGAGGACA CACCATAGCG CACTGACCAC 9420 

ACTGGATACA AACAGACGGA TCCCAAATGG GTATAGTCTC GGCTATACAG CGCTTCTCGT 9480 

AtGCGTGGTA CCAGTAGGAT AGGTACCATC CTCTGGTAGT GCGCTCACCC CAAGACTATC 9540 

CCCCTGATTG AGCGCAATAG TACCTAACAC GCTTTGCACA AACTCCGGAG CATCGGAACT 9600 

CATCGCAGGA CGACGCGTCA CCAAACTACC GGCAACTCCC GGATACTCCA CCAATCCCAC 9660 

CCCAGCGAGC GCCATATCGA TAGTGGTGAT GTTCCTCTGT ACAACCTCCC CACCCTTTTT 9720 

GCCGTAGgCc TtCTGTATAA ATTTCTTAAT CAGGTCAATC GCCTCAGCTT CCGGCAAGAT 9780 

ACCAAAAATT TTGAAAAAAG CCGTTTGCAT CACCACATTG ATACGTGTGC CCATCCCCGC 9840 

CTTCTGAGCG ATAGAAATCG CATCGATGAC GTAAAACTTC ACCTCCTTTT CAATGATCTG 9900 

ACGCTGGACT TCTATGGGTA TGTGATGCCA CACCTCATGC TCACTGTACG GCGCATTCAG 9960 

CAAAAAGGTC CCTCCACGCT TGAGCGTTTT GAGCATGTCA AAGGTTTCAA GGTACGTAAA 10020 

CTTATGACAC GCTACAAAAT CCGCCTGCGT AATGAGG TAG GGCTTACGGA TCTTCTGCTT 10080 

TCCAAAACGC AAATGAGAAA TAGTAAAACC ACCAGACTTC TTGCTATCGT AGGCAAAGTA 10140 

AGCCTGCGCG TTATTATCCG TCGCCTCACC AATAATCTTA ATTGAATTTT TATTCGCGCC 10200 

TACTGTACCG TCCGAGCCCA GACCATAGAA CACCGCCTGA CACACATCTT GATCATCAAG 10260 

CTGAAAGTTC GGATCAAAGT CTACGCTGCT GAACGTAACA TCATCCTCTA TACCGACCGA 10320 

GAAGTTCGGG ATCTTCTTCC CACTGAGGTT ATCAAACACT CCTTTGGCCA TCGCGGGCGT 10380 
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AAACTCCTTA GAACCCAGGC CATAGCGACC ACCGAGCACG AGAGGGTAAT GCGTAAACGG 
ACACTTCTTC TGGCTCTGCA TCTGGCCGAT AGCGGTGCGC ACATCCTCAT AG AG AGG TTC 
GCCCAGAGAA CCTGGCTCTT TCGTTCGATC GAGCACTGCA ATCGCCTGCA CCGTTTTGGG 
CAATGCATTG ACAAAACACT CTGCGCTGAA CGGGCGATAC AAGCGCACCT TGACTAGACC 
ACACTTTCCT CCCTGAGCAT TGAGCACATC AACTGTCTCT TCAACGGCCT CAGAGCCGGA 
GCCAATCATG ACAATCACCT TCTCTGCATC GGGTGCACCG TAGTAATCGA AAAGACGGTA 
cTGGCGTCCG GTAAGCGCCG CGTAGykCCA TAGCTTTTTG GACAATGGAG GGCGCAACCG 
CATAGTACCT ATTCACTGAT TCGCGGACCT GAAAATACAC ATCAGGATTC TGTGCTGTGC 
CGCGGACCAC TGGCTTTTCG GGAGTCAGTC CACGCATGCG GTGCGCATGG ACAAGTTCGT 
CGTCGATCAT AGCACGCATG ACGTCATAAG AGACTTCTTC AATTTTCTGA ATCTCATGAG 
AAGTCCTAAA ACCGTCAAAA AAATGAACAA AAGGCACGCG CGCCTCGAGC GTCGCAGCAT 
GAGCAATAAC TGCGGTGTCC ATGGCCTCCT GAACACTGTT GGAAGCAAGG AGCGCCCAAC 
CTGTCTGGCG GCACgCcATC ACGTCTTGAT GATCACCAAA GATAGAAAGA GAACTTGTGG 
cGACAGsrCG TGCAGCAACG TGAAAAACAG CGCTCGTAAG CTCCCCTGCG ATC TTAT AC A 
TATTCGGGAT CATAAGCAGC AATCCCTGAG AAGCAGTAAA AGTAGAAGAG AGCGCCCCCG 
TCGTCAGTGC GCCATGAACA GCTCCCGAAG CGCCTGCCTC AGACTGAAGT TCTACAACGG 
TGGGAACGGT ACCCCAGATA TTTGTGCGCC CCCGTGCGGA AT ATTCGTC T GCGATTTCTC 
CCATAGGACT GGAGGGAGTG ATAGGGAAGA TAGCAATGAC CTCACTAAGC GCGTGAGCAA 
CGTGCCCCat GCGGTGTTAC CATCCATCAT GaCGAGGTTC TTCTCAGACA TACGACCGTC 
CTCTCTCTAT AAAGTATCAG GGCAACCGGG TGCAGGGAAA CACGCTCCAT ATCCGCCTCG 
ATCTCCCCGT GTCCCGGCTA TAGTAGCACA CCCCGCCTGA ATGTGCATCG GCTGCACGCG 
GGACTCACGC TTTTTTTCAA AAAACAAGCA TCACTTCTCC CTGTTCAGAA AAAAAGAACA 
CGCGCTTACT CCCCTGACAG C ACACGTTC A AAAAG C AC AT CAAGTTCTCG CTCTGAATAG 
CGACGACACG CAAGAAAACT CTCCTTCACC GCCGCTGCCG TCTGTTCATC CAAGCTCAGA 
CTTTTTGTAT CCCGCAGATA GAGGACGAAA TCTTCAAAAA CCGAAAGCGG CAACCACGGA 
AATGCCACCT CAAAAATGCC ATACCAGTGA CCATACGCGC GCGCTTCTAC AGACGCAAAA 
GCCACCCCTT GCTCTTTTAG TGACCCACGC ATGTCCGGAT GCACGGTGAA TACCTTTTGG 
TTCAAGAGAT CACAGTACTC CTGATCCGAA AAAAGCTTAC CGGTTCTCCG GATAAGCCCC 
ATCTTTTCAA CCGCAGACAC ATCTGCAAGG ACAGAAGCAG TAACAAAGTT CAACCCCTGA 
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TCCTGTACCT C AT AGAGC AC ATACCGCGCG CGTTGC s AC G CTCCACACTC TGCTTTAACT 
CACCTTTTTC CAAAAGGGGA CGCAAAATAA CAGCGGGTTC CACAGCGCAT TGTCACACAA 
CCGACTGGAA AAATCCAGCT TGCCGCACCT CCTCCCCTGA GATGGTCTGG CAAAAGCGCT 
TATAAAC TC A GATCAATTAC CGTCCCTGAG TCCTCCCCGT CCTCATAGCT CACCAGCCCA 
CGCGCCGCAC GAAATTCATC GAGCTGCTGG CGCAAATCGC TGACCTGCGC ACGGAGGAGG 
GCTCGGTTTT TTTCATTCTG CGCATATACC TGCTCCTGTA CTCGAGACAG ATCACACCGT 
AACTGCGCAA sTCCTGCACA CCAGACGCCT GCTGCAGACA CAGCTGTTCC ATCGGAGCCA 
ACGCCTTTTG CACACGCTGA ACATCTCCAA GCAACGCTTC TTC AAGTTCC ACATGACGCA 
ACACCGTCTC TGCGTCCTGC GCCTCAATGG CTACCTGTTG CTGCTGCAAC AC C ACCAAAT 
ACTGCGCAAA CTTTTCcCGC TGCTCCTGCA GCAGTGCCTT CAAGCGCTTG AGAGTCGCAA 
CCCGACGCGC TACCTCTTCG TCTGATACCC GCGCACCGTC CATGcACCCT ACCCCGCAAG 
GTTCACGTTA TATGAAGCAG GCGACGCAGA AGTAACGGAG GATGCAGGAC TCCTGACAAT 
TTGAAACCAC GCCTCGCGCA ACTGC CTCAT CATATcGCGC ACGGTcACCA ATTCATCAGC 
CCGCTTcTGG a T ATTCGCGT GAAAcAGCTG cTGGTTGAAA TACGCGTAAA TAGAAAGCAA 
GTTCTGCGCT ATCTTCTCCC CTGCTTCCAT GTCCAACGAC ACGGAAAGCT CCGTAATTAT 
CTCTTGCGCT TTCAAAATAT GACGGTGCAC CCGCTCAATA TCAGAGGCGG GAATCTTTTG 
CACGTCCATA AGCTCAATCG CACACCCCAA CTGCTTAATC CCTTCGTCGT ACAACAGCAA 
AATAAGCTCA CCCTGACTCG CCGTCTTCAC ATC CACCTGT CGATACGCAC TCAGCGCGGG 
ATCCTCATAC GCCATAGCAG CCTCCATCGT AGCAAAACAA TAGCGCCAAT ATCGACCACA 
ACACTAACCC GCCTTAAGTA CCGCCGCAGC CCCCTGCCTA TCCCCTGATA CCGTGCCAGA 
CTGGGAGTGT ACACTCTGAA AAACACTCCG CGCCTTTAAA TACAGGATAT CCCCCCGCAT 
CTGCACCTTC CCCAGCACCC GAATTGGTTT TTGCGGGTCT ACGGAAACCA CAAAATTACA 
AAATACCGGC ACAATCCCCT CAAGCTTTGT CAGTCGGTCA TACCCAACAA GCAGATTAAA 
ACTCACACTC CGCTCCGTCC GTTTGACGTT CGCCGCCATC CCTTGCCACA CCACCCAGCA 
ATCCAGGTAT AGCCCCTTTT GctCACGCAC CTGCACATAT GGATAATTAT CTTCGTCGGG 
AAACGTATCA AAGGTCGGAA GCGCAAAATA GTCCATGAGC ATACGTGCCC GGCGTTTAAT 
CGAATGCGAC GCATTGGAAG CAAGCAGCTT ATTTATTTCC ACCTGTGCTG CATTGTCCTT 
GAACGTCTGA AAGTATTTTT GCGCGTTGGC GTACGACTGC AAAACATCCT GCTTACTCAG 
CGTATACCGG TATGAACCGC TGCCTTCAAG CGGCCGTGCG TACTCCTCCT TCGTGAGCGT 
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TAAAAACGAT ACG TCTGC AC GCGAAgTCCG CGCAGTCCCC CTCCACAAAC CTGAGTGCCA 
CACACGCAAT CCTGCTATAC CCAACGCGCC GACTAACAGG GTACACAGGC ATCCCACCCC 
TACCGCGCAC ACTACAGATC GGCGCCGATA TTCGGCAACC CCAGGGCGTG GATACAAAGC 
CTTAACCTTT CCACTTTGAA TAAAACCAGC AAGCGACTCA GGCGTATg t G cGyTT tCAGC 
GCGTTCAGAC CCTTCTGAGC AAGCCGATGT TTGGGATCCG CATCGAGCAC CGCGATATAA 
CGTTCAACCG CACGGTTCGT ATCACGCCGC TTGAGTGCCA GCGCTGCCTC CGCACACAAC 
AGCGTCGGAT GCGACATCTT AATCTGCCGC GCGCGACCCA AATACGAAAC CGCACCGGTT 
ACTATTCCCT CATGCAAACA CGCAAGCCCC AGGTATAAAT GAAAGACAAA GGATTCGTGA 
TAATCCAACA CGCAAGGCTC AAGCAGTTTG ATCACCTCGC CATACTTTTC CTGCGCAAAC 
TTCCTCTTCG CTCGATCCAG TACTGACAAC GCCATAAGTG CTCTGCTTTA CAAACATTAC 
CATAACTTGC ACGAACAACC ACTGTCGCGT GCAAGCACCA CGCAGCAAAA ACTAACCCCT 
ATGCATAGCA AGAAGCAGTA TCCCCGCAGA AAAACAAGCG CTCTAGTACC CTATGAGCGA 
CAGGGTCTCC TT AC AC C TG A ACAC C TTACG CCCCTGAGCT GTCCTTGCAG AGAAGAGACC 
TCAAGGTACC GGAAAAAAAA GCAGAGCCTG ACACTATCAC ATCCGCACCA GCGTCAagroG 
CCTGCGGCAA CGTGCGACAG TCGATGCCCC CATCAACCGA GATCATGTAC GAATACCCCC 
GTTCGGTGCG CATCTGCACA AGTGCTGACA CTTTAGAAAG GCAATGGGCA ATCATCTGCT 
GTCCGCTAAA ACCAGGATTA ACCGTCATTA CCAGCACTAG GTCCACGAAG GGCAGCACTT 
CACTGAGCGC AgmAACAGGA GTAGACGGCA CGAGGCTAAT ACCCACCTTC ACTCCCCGAC 
CACGAATGGC ATGGATAAGC CGGTGTGCAT GCACCTCCGC CTCTATGTGA AAAGTTAAGA 
AGTCCGCGCC CGCCTGCACA AAATCCTCAA TGAGGTCGGC AGGCCTACTG ACCATCAGGT 
GAACATCAAA CGGCAGGTGC GTTTTGCTAC GCAAACAACG CAGCACCGGA GCACCAAACG 
TCAGgTTTGG CACAAAGTGC CCATCCATAA CATCCAGGTG CACCCACTGT GCGCCGTGCs 
yTTCCAAATA CACCAGCGCC CTATCGAGCG CAGAGAAATC TGCACTTAAT AGTGAAGGTG 
CCAATGTAAA AGACCGCTCC ATAGGTCCAT GCTAGCAAAC AAATCGAGCA CCTGTAAATA 
TGGATACTGG CGTGAACAAC CCCAAACATT CAAGACAGTG TCCTGCTTAT CCTGTTTTAT 
TTGCGCCTAA ACCGATAGAA AAATATCCTT CCCGCGGGTA GGCTGACCCC GTCATGGTGA 
CTCCGAGCCA ACACGCGCTC CTTGCAGAAG GCGTAgcACA CCTAACGGAC GCGCGCACCG 
CACCTGCTCT TTGCGCTCTC CTGAAACAGT ATTTGGAAGA ACTTATTCTC TTTAACACGC 
GCGCACACCT GGTGCATGTA ACACACACAG AGGAACTTAT CACACACCAC CTATTAGACA 
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GCCTCAGCGC CTGGCCACAT TTCACCAACG CGCGCGc t AT 
CGGGCTTGCC AGGGATCCCG CTTGCCTGCG CACTCGCGTT 
TGACGCTCAT CGAGCGGCGT GAGAAACGCA TAGCATTCCT 
TGGCGCTCCC CCACCTTCGT ATCGTGCATG CGGACGCGCA 
ATGACGCAAT CACCTTTCGC GCTCTGTGTC CCCTCAACCA 
TGAACAAACT GCGCCCTGGC GGCGTAATAC TCGCG T AC AA 
AACAGGAAAC GCGTGATTTC CTCCCACAGT CTTGCTCTGT 
TCCTCCACGA AGCACGGCAC CTCGTTGCCA TACACACACC 
GACACAGGAG CACAGCAGAG AGGAGAACAT ACAAACGGCA 
CGGATACCCG TTTTCGCAGC TGGTCGAAcT GCGCCTGATC 
CCGCACCACG GCGCACTATT TTGAGTGAGC GAATCGTATC 
CCACTTCCAT ACCCTCAACG ACTTTGCCAA ACACGGTATG 
TAgcACGTGG GTAATAAAAA ACTGCGAACC GTTCGTTCCT 
CAACACTCCT GGGCTGTCGT GTCGCAACGC AGGATCACAT 
AGGACCTCCC GTCCCATtTC CtGCGGGTCT CCCCCCTGGA 
CGGTGAAAcG TTAACCCCTG ATAAAAAGGA CGACCcTTGC 
GCTAACCCCA CAAAATTACA CACCGTAAGC GGCGCCTTTT 
ATCGTTCCCC GATTTGTTTC CATTACCGCA TATATACCGT 
TCGCGTACCA TTTTTTCCTC CGCACACCCG ATCCTGCCAA 
CCCACACAGA CGCGCCACAC tGCGTATTCA TATCGCACCC 
CTGCTTGTAA AGCCCTTCCC CCTCACTTCT GGTCTCTACC 
TGTATGAATA AGCCTCCGCT CATAGGGATT CATTGGCTCC 
CGCACTTACC TGATCTGCAG TAGCATATGC CATGCGGATG 
GCGGTAATTC TCACAGTCAA GCACAACCTT CACCCCTTTA 
T AC ATTC GC T AATAGTTGCA ACGCATCGAG GTTCTTCCCC 
AGAATGGCTT GAGTGCAATC TAACCACCAC CCGATCTGAC 
TGTCACCGCG TAGCCCATTG CGTGGAATAC ATGAGATAAA 
AAGATCTGCA AACTGCTGCT CACTCAACAC AGAAACCGGA 
ATGACCCGGC GAGGCGGTGA CCGCCGCTAC ATGCACACGA 



CGCCGACATC GGATCTGGCG 
GTATGCACCA GAAACAGAAC 
TGAAAATGCC TGCGCGCGTC 
CGACCTCACT CCCTACACGT 
CCCAACGGTA TATATGCTCC 
AGGGAAGAGA AAACTCATCG 
CTTCCCCCTC CATGTTCCCT 
CTGCGCAGCg cCTCCCCAGT 
TGCCCATTAT TTGGACGCTG 
GCAGACAAAA CgsTTCGCTG 
TCCCGCTATA ATTGCATGCA 
CTTTCCATCC AACCACGGCG 
GGTCCTGCAT TCGCCATTGA 
TCATCGGGGA ATTGATAGCC 
TCATAAAATC TTTGATAACA 
ACACCGCCAA CGTTCCCTCT 
CAAAAAAAAG CGAGAGAACA 
CAGCGACCGC CAACCCTTCC 
CGAAGCAGAA CAACATCACC 
CCGGTGTGAC ACACCCGCAC 
CCACTGATCC CATGCAGCGT 
AGCAACACCG ATCGCTTGCT 
AGCATTTCTT CCCGTCGGAT 
GCACCAATTT TGGTAAAAAA 
TTCTTTCCAA TCAAAATTGC 
TCCCGTACGA AGGCATCGAT 
AAAGCACGcA GctGCGCCCC 
TCGTACTGCG CATGCGCCGC 
ATACGCGCAA AACCCTTCTT 
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GAAAAGAGAG GATTTTTGTA TCTCCAGTAT CTCCACATCA AACTGATCTA CTCCCAATCC 
CAGCTCTGCA GCGGCACGcg CAATGGCGTC CTGTTCTGTC TTGCCTTCAA ACTCATACAC 
CATACACCGA CTCCTGCATC AGGTTTTATT CTTGTTAGCC GTACGCTTCA TCACCAACTG 
CTGTACAAGC GTCACTCCGT TCATTGCCGT CCAATACACT AGAAGACCCG AGGGCGCATC 
ATAAAAGAAA AAGAAAAAGA ACAACGGCAT CACATAGGTC ATAATCGTCA TGGATGTTTT 
TTGCTGCTCT GTGTGCGGTA CtGCGTCAAC TTACTAAACA TAATTTGAGA GACTACATAC 
AAAACCGGCA GCATACGCAT TTGAGTCCAC TGTGTCACCG GCAATGCGAA CGGCAGTGTC 
CACACGCTGT CTGCCAACGA AAGATCAGGA ATCCAATACG GGATAAACAT CGCACCACGG 
AACTCGAAGT AGTTATTGAA TAACCGATAC ATCGCAAAAA TAATAGGCAT CTGTACAAGC 
GTTGGGAGAC AGCCTGAAAG CGGATTGTAC TGCGCTTCCC GGTAGAGTTT CGCCATTTCC 
TCATGTATCT TCTGCGTATT CCCTTTGTAC CGCTCTTGGA TACGCTGCAT GTGTGGCTGC 
AG TTC TTGC A TCTTTTGCAT AGCGATAAAG CTCCTTTTCG TCAGCGGGAA AAAGAGCACC 
TTTATTGCAA TCGTCACCAA AATAATTGCC ACGCCCCAAT TAGGAATGAG GGTGTAAAAC 
AAACGCAGGA GCCACTTAAG GAGCACCTCA AGCGGATAGA GAATACCACC GCTTACTGCC 
ACCGCATCGA TATACGTGCG CTCAAGCCCA TAGGGATTTC GAGAGGCAAC GTTGTACGCA 
CTCAAATACT GCTCTGCGCA CGGGCCGATG TACACACGAT AAACATCTGC AACCGC AG s T 
GCGCAACAGC GCGGCGCACA AACGCGATGT GATGCTGCAC AGCTGTTTCA GCTTGGGGAG 
CCGATAGCAC TAGTCTTTTC AGACTGTCCG CATCATTGGG CAAAACGATG AGCGCAAAGT 
ACTTACCCGA GACACTCGCC CAAGAGACAG GCGTATCTAC CTGTTCAcGT CCATCTCCTT 
TCAGAGCATA CGTTTTCGCC TTGCCACCTG CACCTACCAT GAAGGTGCGA AACTCATACT 
TGTCCGCCGC ATTCCGCTCA GGCCCGATCT CAGGCGGTGT GCGCAGGGTA TAACTTGCTG 
TCCCAAAGTC AAAGCCATTC GCCCGCGCAA GAACAGTAGT CGCTGGGCCA GCATTCGTCT 
CCGCCACCGC GCGCACCTTC GCCCCTTGCC GGCTGTCTTC CCGCTCCTCT AGAACGTCTG 
CACTCAGGGA AACGTGCAAC TCAAACATAT AATTATCAGG ATAGAATACG TAaTCsTTCG 
CCAGCACGAA AGGAGTCCGT GTACCGTCAG CATGCTGCGT GGCAACACTG CGGTAAAAAC 
CTATGGAATG CAcGCCACGC GCGCCTATCT CCTGCTTTAC TTGGAAAAGT GCATTCACAT 
TCGGAGCATA CTCGTCCCCC AGCGCGAGCG AGAAAGCTCG ATGGTCTGGA CGTGCCTGTT ' 
CCACCATTTC TACGTACTCA CGTCGTTGCG CCGCATAGTG tCACGCAACT GATACGAAAG 
AATATCTCCA CCACGATTGG TAAACGTCAC TTGCACTAGC GGGGTGCGTA CCACATACGT 
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GCGCTCTACA CGCTCTTCAG TTTCTGATTC GGGAAACACC AGCACCTGGC CGGAAGGATG 
CgCaGGCTGC GTGGTCTCTT GCGTATCTGC TGCACCCCCG TGAGCACTTT GAGTGCGCGT 
CTCCTCAGGT ACGGCCGACA GGGTGTGTTC CGCAGCAGAG TGCGAGGAGG ATGCAGACGG 
ATACAAAAGT TCCTGAAGGA ACGAAGCTCC CACAAGCACT AACACCGACA ATACGACTGC 
AATAAACATA TTTTTTTTCA TCACGAACAC TCCTGCGCGT CTGTTTGCAA TGGGAGTACG 
TCCTTTGTTG GTACTGGGTC GTAACCACCT CGAGCAAAAG GATGACAACG TACAACTCTC 
TTTACAGTAA GAAAAAGACC CCATACACAT CCATGTACTC GCACGCTCTC ATAGGCATAC 
TGCGAGCAAC TCGGATAATA ACGGCACGAA GGCAAAAGAT GAGGGGAAAG AGCGCGCTGG 
TAAAAACGGA TCAGCGCGAG CAACAATCCA CTAAATACCC ATGcACACAA GCGACGCACA 
CTCACGGCCT ATTCC TGCCT TTCAGCGGTA AATTCAGAGA ACAGACGCGC GCGATCACAC 
AACACGCACA GCAGCCGCTG ATACGCCGCA AGGTGtCTTC CACAACGGAA ACCAAGAGGA 
CCAAATCAAA ACCTGGAACG AGAGAACTTT TGAGTGCACG ATACGCCTCT TTTGAArGCC 
GTCGTGCGCG ATTGCGCGCG ACGcTTTTCC ATAGCCGCGC CGAAAAGTAG CAAGGAACCG 
ACTATACGCA CACCCGTTTG GCAACACAAA AAGACACGCG CGCCCGTAGC AAAACC T ACG 
TCCCTGTTTG AACACTGCAC GCACCCGGCA GGAACCGCGC AACCGTTCGC AGGCATCAAA 
CGTAACGCGG ACGGTGAAAC AACGCGAAGA AACGCGTGAG GCAGCCACAA GGCCTAGTAA 
GGTTTTTTCT CGTCAGAAAC GGTGAGCTTG CGTCGTCCcT TCGCACGACG ACGCCGGAGT 
ATCGCGCGCC CACCACGCGT TGCCATGCGC GCGCGAAAGC CAAATTTCCG cACGCGTTTC 
CGCCTGCTTG GTTGATAGGT CCGCTTCACC GATTTCTCCT CCCCCTGCTA AGAACACGAG 
GGTCCACTCT GTACCAAAGG T AC C GGAGGC GTCAAGATCC CCTCTTGTAG GGGGAACCGG 
CTCCTCGGGA GGAGCCGCGT CAGCACCTTT AATTCGAAAT GTTTTGCTCA CAGAAAAAGG 
GATAGGAGTG TGCGCGCCCT GTTTCTTGCG ATAAGTGAGG CTTTAAATTC CGGAACGGTC 
TCCCATGTTC ATTAAATAAG GCGCGATATT CCTCTTCTTC CCGCAAAAAA AGGCGAAACC 
AGGCTGATAT CAGGCGACGG TACAACGCGA ACGATTCCAC ACTCAGCGTT TTGTTGTACG 
TAAATCCCCA CACTTCAAAA TATTTATCCG TCCCCGCCCC ATGTCCCATA CCTTTTAGCG 
ACAAAAAGCA CGCAGGCACA TCCTGTGGTA AACTCTGAAA GAACTCATAC GACCGTTCAG 
GATATGCCAA CATATCCTTT GTTCCCGTCA GAATCAGCAC CGCAGCACGT ACCGAGCGCA 
AATCAGTTCC CAGTTGCTCG TTTTTTCCCC CAACCATGTT CACCATATCA GCGCCCCCGT 
TAAAGGGATG CAACGCAACT ACCGTTCGAA TACGCTCAGG GTACCGATTT GCATAGTGCA 
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GGGCAGCTGT ACCACCCATC GAGTGTCCCA ACACCCCTAC ACGCGTAAAA TCAACCTTTC 
GATACAGAGG AGAGCCCTCC TGCTCATTTA CGCGCTGCAT AAGTGCATAG ACACTATCAA 
ACGTACTCAA AAAATCGGGT GGCCGcCGCT GAGCTCGCGA CGTAAACACC ACCGTGACAA 
ATCCCTGTGC AGCCAGAAAG CGCGcTAACG CACGCTGATA ATCTTGCGTG . CTATTCCATC 
CGCGCGAAAG CATGATAAGC GGATACGTAC CCTGATGCGC TGGATAATAC ACCGAGGCAG 
GATAACGCAG GTGCACTCGG TCAGTGAGTT GGTCATCAAA CTGATTATCA CGCACAGTAC 
GCTTGTGCAT TTTATTCAGA AGGGAGACAT CATCGTTACA CGCCGCCACT TCGAAGACTT 
CTTGGAAAGT AAGATCCTGC GCACGCAAGA AAAACGAGAA GAAAATACTA AATGCCAACC 
CACACCATTT TTTCATCACA CACCCGCACA GAAACAAGCA TAGCAACGCC GCCGCTTACA 
CGCCCGCTCC GTTTGCCCGT GACTGCTCCA TCACTTTACA CGTGCTGCAG C r TTCCTTAC 
CGTCACTTAG CGAAACAGAT TCATCAGCAG CATCGGCACT GAAATTAAAC TGCCAATCAT 
TGCCCCTATG CTTGAAAGCA CCACCACTAA CAGTACATGC GCAAACTTAT TACGATACCA 
GCCACGGAAC GTAACAATAT CGTCCGCCAG GCGCTCCATA TCCGCCACTT GCGGCCGGCA 
CACCCACGCC TGCGCAAAAC CCGTAAACAA ACCCACCCCG ATT ATCGGC G TGAGTACTGC 
AATTGGCGCC CCCACAAAAC CCACCACTAT GCTGAGCGGA TGTCCGAGTG CACACAACAC 
CCCCAGTGCT GTCATACTCC CGCTCCACCA TAGCCACTGC ATCAGTGCAT CGCGCGATGC 
GCCTaCGCCG CCAGCAAAAA AGCATGTCAC CACTACCCCC ATCAGCAGCA ACGGAAACAG 
CCAACCTAAC AGCTGCCCCG CATGCGGAGA ACTACCCACC GTTTCTAGAT TCGTCACTTC 
AGCCGTGCGT GCTCCGCAnA AaAACTCGTA CAAACACCGT TGCACACCCG CCACGCTCCC 
TGcGcTGACT ACTGCCATTA CCACTTGGCT GTCCACCGCC CAAATTTTGG AAGC C AG AT A 
TTGGTTCCGC TCGTCCACCA ATACCCCCTT TACCGCCGGT AGGAACGAGA TAATCTCTTG 
CATTAACCCA TCCATCGCGC CGTGCGAACA CAGTACCGCA ACGgCTGsTC G S TyAACTGT 
TcCCCAGTAA ACGCCACACT CAGCAACACT GCCAGCAGCT TTGCTCGGCC CCAGGGATTT 
AACACGCGCC AGGCACGTCT GAGCGTCACC TCAATAGACC GATCGATATA CGCTACCTGG 
GCAGAAAGtC CGCTGCCACC TCAATTGCCG CTTTCACTTC ATCCCCAAGT CTTACGCCAG 
TACCGGAACT CAGACGTTTC TGAAACGCAG ACAGGGCTAG AGTACTGAGG AGAAAGAAGC 
CTTTCCCTTC ACGCAAAACC CGCGCAAGGT CATATTCCTG CCACTGGCGC ATCCCCAAGA 
GGTCCcTGCG CACGCGCATC GTCCACTTCC ACACACACAC ACTGCGGACG CCTCGCACGA 
ATnGTGCGAC GCACGCACTC AATGGCTTCC TGAGAAGTAC AGGCTACCCC CATAAGCACC 
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ACACGGCGAG ACCCGAACTC AAGACACGTC AGCTGGTGAT CCATAACCAC TTTCAAAGGG 
TAACCTTTAC CCGAGACTCA CGCCATCAAC CAGTCGCACC TTGCCGACTG TCCCTCCCAC 
ACACGGTGGC GCCCAAGACG GCCTAGGGAT TCGTACGCAG CGACGATGAC CTCCCTCGTA 
AACGCCCAAG AGCACTCACC GCAAGTCGGT GCTCAAAGGA AGAAGGAGCA CGCGCCGATT 
CTACTTCACC GTAATGTTTC TCGGCAAGCA GATGCCGCCC CCCGAGTTCG TAAAAAAAGG 
CAAGGTAAAA CGAGTATCTT ATACGCTGGG GCACTGACTT AATTTTAGAT ATACGGGAAG 
CCATATCCTG TTCCCCACTC AAATCGACAA ACAAGCGACA CAAAAAATAC TCCACTTCCC 
GTCGCGTTCT ATCTACGGTC CGAATAAACG TCCGCATAAA GTGCTGCGCC TTGTGTGCCT 
GGCCCATCTG ACACAAACAC AGCGCCGTCA TGAGCGCATA CGAGATATTC GTTGGGGCGT 
AGGTAAGCGC AGTTGCAAAA GCCTCACGCG CCTCTTCCCA CCGCTGCTGC TCCCAAAATA 
GCACTCCTAA GCTTTCGAAA GAAAAATGGT ATTTCGGATA CAGCTGCACC GCGCGCTGAT 
AGTGCTCAAT CGCTTTCTCC GCTCGCCCGA GCTCGTCGTA AATTCCCCCC AG AT AAATG T 
GGGCAAAATA CGCATCAGCA GAAAGCGCCA CTGCGCGCTC GAACGCCGCT GCCGCACGCT 
CCTTTTTTCC CGCCTGGGAC AAATACGTGC CGAGATCTGT CCAATATGCA GGATCATGCG 
GATCAAGCTG CACCACACGT TCCAGATCCC TAATAGCTTC GAGCACTCGG TTCGTTTCCG 
CTTTCACCCG CGCACATTCT GCAAGCGCGC GCTCATGTTC AGGCGTATCC TGCAGTACCT 
GTCGGTACTG TGCCTCTGCC TCTTGCATCT TTCCCTGCAG ATAGTACACC TTCCCCAAAC 
CTACCCGCGC GTCCTGCGCA CGCGGcTCCA CGCGCAGCGC GCGnAGAAAG CCTGcACCGC 
CTGGGCATAA TCATTAACAC TTAAAAAATC GTAACCGCGT TCAGTCAGTG CCCACAGATC 
GTGCGGaTCC TGCGCAAGAA TCTTTTCTAC ATACTGTTTC TTCTTGCGCA CATCTCGTTT 
CGCTTGCGCT ATCATTGCGT GTGCGTACCA CAGTTGCACC GTTTCAGAAG CCGTAGgACC 
GTCCTGCCCA AGCTTTTCTG CAAGCTCCTG CGCGTGCGTC AATTTCCcTG CGGAAATGAG 
CGTAGAAAGA TACAGGTATT GGATGCGCTT TTCGGCACGA TGCTCAGGGC TCAACGTATC 
GAACAGCTGC AACGCCTCTT CCCACCGCTG CTTTTCCAAC AACCCACTGA GCTGAGCTGC 
AAAACGGATA TTAGGATTTT CGCTTTTTAA AAGCTGTGCC CGCTCGCTCT CCCGAGCGCT 
GTTCGATCCA GTACTAACAC AAGATACAAA GAGCGCACCT AAGAGC AC AC TCACCTTAAA 
AAGAGTGCCA TACCGCACAC CGACCCCCTG TACCGCACGC TGCCGACACC CGCGCGTCGC 
TGAATTCTCG GCGTCTTTCT TTTACCTTTT TATTTAAATC GCAGAGGAAC TTTCCTGGAG 
CTCCCCCACA AGAGCCATGC CCTTGACCTG AGAGGGAAAA TCAGCTTATG CTTCCsCCGA 
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TGGCGTGCGG GGAGTCCTGG GAAGAGCTCA AATATCGTGA AGTTTTTGAG GAAGAATTGA 
GCGCGCTCGA GCACCGTCGC CAGCGCGATC CAACGTGCAG CGTCTCGGAT ATCGAGGCGG 
TTCTGGAAAC TCTTTATCTC ATGGATGGGA ATAATCAGGA TGGGCGTGGG AACCCCCGTC 
AAATCGGTCT GGACGCTACC ATAGCCGCGT ACGAGCAGTT TCTGTGCGAG TGGAGACGCC 
AGCTGAGCAC TGCCTCGCCC CTGAGCATGG AAAAGAAATG AAACACCCCT CGGTGCGCGT 
ATGCTGCTTT GCGTTCGCAT CCTGTCTTCT TTGTGCAGGC TGTTCACTGA AAAGGCTCGC 
CTTTTCCTCT CTCTCCCACA CGCTCGCTCC CTTTCCTGAG GGGGAACTGG ACGCGCACCT 
TTCGGACGCC GATTTTACGC GCGTTTTCAC CGAGGAAGAT GATCTTGATT TAGTCGCCCA 
GTCCCTCCCA cTGGTGCTCA AGGTGTACGA AGCGCTGCAT CTGCAGAATC CCGCGCACAG 
AGGACTATCC CTCGCTGTCG GCAGGCTCTA TATCATGTAC GCTAATGCTT TTGTCCAGAC 
CCCTGCTCAG TATTTGCCAG AAGACGAGTT TGAGGCGCAG AACGAAGCCT ATTCGCGCGC 
GAGGAAACTG TATTTGCGTG GCGCGCGCTA TGCGCTCTCC TCGCTAGAAA CCGCATATCC 
GGGCTTCACC CGTGAGGTAT TCTCCGGGGA TGAGCAACGG TTGCACAAGG TACTTTCTCG 
CTGTACGCGT GTGGATGTGG GCACCCTTTA CTGGGTAGGT ACGGGGTACG TGGCGGCGTT 
CGCCCTTACC CCTCTGGGAA GCGCGCTCCC AGACACCGTG CATGCGGCGG TGATGATGCT 
TGAGAGAGCC TGCGATCTGT GGCCTTCGTA TCAGGAAGGA GCAGTCTGGA ACGTACTGAC 
CAAGTTTTAC GCCGCAGCAC CAGAGTCTTT CGGTGGGGGG ATGGAGAAGG CACATACCGC 
GTTCGAACAC CTTACGCGGT ACTGCAGCGC GCACGACCCT GATCACCACA TCACATACGC 
TGATGCGCTG TGCATACCCC TTAACAATCG TGCAGGTTTT GACGAGGCAC TCGATCGCGC 
TCTTGCCATT GACCCTGAGT CGGTGCCGCA TAATAAACTA CTGGTGATCC TTTCTCAAAA 
GCGTGCACGT TGGTTAAAGG CGCACGTGCA GGATTTTTTC TTGGATTGAG AATAAGCAGA 
ATTCGTGGTG CAGGTAGTCT CCCTGCACAG GACGCGCGTT CTTGTGTAAA AAATTACTTT 
TTGCAAAAGG AATATCTGTA TGCGAACGTA CTTTTTCATG AGTGTCTGCT CGGTACTCAC 
CTGTTTTGGC CTCTATGCAA AAGAAAAAGT GGTGTTGAAG ATCGCTTCCA TTGCCCCTGC 
ACGCTCCATC TGGGAAACAG AGCTGAAAAA GCTTTCAGCA GAATGGAGTG AAATTACTGG 
CGGTCTGGTG TCCATGAAGT TTTATGACAT GAGTTCGCTC GGAGGAGAAC GAGAGGGAAT 
TAGAAAATTA AAATCCAGTC GTCCTGGTCA GGCAGCTCCT CTTGATGGAG CTGTTTTCAG 
TTGTTTAGGT CTGAGCGAAC TCGCGCCAGA TTCCGGTATC TATACGCTCT CGGTCCCCTT 
TCTCATTCAA AATGAGAAAG ATTTAGAACG AGTTCTGCAT GAGCTGCGCG AAGATTTAGA 
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CAGACCCTTC CGCGCAcAGG TTTTCGCGTC ATCACGTGGA CGAACGCCGG 
TTTTACACAC GCGCGCCGTA CGCATCGTTA GGACAATTAA AAAAACAGAC 
TCCAGCCTAG ACAGCTCGGT CCTCGGTACC TGTTTTAGAA TATGCGGTTT 
GATGCACCGA ACGCGCGCCT TGCACCGTTA CTGAAAGCAG GTAGCATCGA 
TCAGTGCATT TGTTCACCTG GGCAACCGGT TTTTACCGGT ACATTTCGTA 
ACTAAGATTT GTCcTGCGGT AATCGGTATG CTCATCTCAG ACGGGTCATG 
CCATCGCGCT ACCACGACGC TATGCTCCAG GCAGCTACAC GCGTAAGACA 
AATAACCTTG AGACACTTGA TCGCGAATGC AGCAACAATA TACAGAAAGC 
ATCGTCCATC TTACCCCGCA GnAAATACAG GAATGGCGTA CCGAGTTCGC 
AAGCGCATCC AGGCGCGCTT ACCTGGCATG TTGAACATGA CTTTGTACGA 
CACCTCTTGT ACAGCGCACA GCgcwgAgcT TAGCCGGTAT AAGAGGGAAG 
GAAGGGTACA CGGGGACAAC TGGTTTTGCG CAgcaTAGcG CTTCTGCTCA 
CATGCTGCTG CCGTTAGTGC TTTTTTTAAT TGAACGGATA TTCGGTTTTC 
CGTAGGTTCC GAGGTGTTCT CCGCGCACGA GGACTTCATT TTCCTTTTTT 
TGACGCCGCG GTTGCACAGT TAGCCTTCGT GTTTTCCTGT GTTGCAGGCA 
CGCGTGAACG TAAACACTTG AGTGTCACCC TGTTCTCGTG CGACGTGGAC 
ACCGCGTTCT TTCCTTCCTC TCTGCGATCT GTACGGTGGC AGTGCTCAGC 
TTGCGTCTGG ACCGAATATC GTCGC AGTTT . TTCGCAAAGA AGAAGCTGTG 
CGTTACGCTG GATTTTTACC GCGCTGCCAT GCATGTACGG CGCGCTTCTT 
CACGAGAAGT CAAGTGTCGT ACGTGCGTCA TCGTTGGACT TTTAGTTGGC 
GCACAGGATC CATCGCCTCT GTGCTTTTCC ATCTCTTTGA CCTGACCGTA 
ATAGTGTCTT TCACGGCTGG GTAGCAGTGG GTACACGACT CTTTTGGCCG 
TCCtTCTTCT GCTCGCTGCA CAGGGTCTCC CGCTTTTTAT TACGCTGCTT 
ATCTGGCGCT GAGCGTCGAT GGAGGATACG TGGATACCCT TCCTCTCGAG 
TCcTCACGGA TACGGGAGGA ATCGTAGCGG TTCCGCTTTT TGCCACTGCA 
TTGCACGCGG CAGTACTGGA ACGCGTnCTg cTTCGCTTGG TAAAAGAAGC 
CTTCGTGGAG GAGCAGCAGT TGCCTGCGTG GCAGTAGCGG CGCTGTTTAC 
GGTGTATCGG GGGTGACAAT CTTGGCCCTA GGAAGCTTAT TCAAGCTGAT 
AACAAATACC CCGAGCACGA TGCAGAAGCG CTCATTACCT CCTCTGGCGC 



TTGGCTTTCT 
TATCGCCCTT 
TGACATCAAA 
CGGTTTTCTT 
CGCGCTCGAC 
GGCGCGAATC 
GCGCCTAGCT 
CGGGGTCTCC 
TGCAGACGTC 
GAAGATCAAA 
GCGATGTCAT 
TTGGGACGCT 
TTACGCGGGG 
TCTCCTCCTC 
TTTtACgctG 
AG AC CGATGC 
GCTTGCTTTT 
TGGGGAGTGC 
TTTCACTACG 
GTGCTGATAA 
CCCCTGCTGG 
TTCGTGCTTC 
GCCATCGCGT 
GGGTACAAGA 
AGTCTGCTGC 
GGTGGGCTGG 
GTCATTAACC 
TCTCACGGGT 
CATCGGACTC 
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CTATTTCCAC CCAGTGCAGC GATTATCATT TTTGGCGCAA CTAACATTCT TACCGTACAT 
ATTGTGGATT TGTTCAAAGG TGCATTGCTT CCCGGGACAT TaCTTGTGCT TTCTGCCATG 
TGCTTAGGGG TGGCAAAAGA TCGCACACAG GTCCGTCCAT CCTTCTCCTG GCAGTTGCTT 
GTCCATGCCG TAAGAGGAAG CGTATTTGAC CTTGCCCTGC CAGTGTGTAT TAGCCTGGGC 
TATTTTTCCG GTACGCTCAA CCTGCTGCAG TGCGCGTCGC TGACAACTCT CCTGGCTTTT 
GTATTAGGTA CGTGGGTGCG CAGGGATTTC ACCGTGAAGG AAgTTGCGCA ACCGCCCTTG 
AGAGTCTGCC TATCGTCGGT GGCATTTTAA TCATTGTCGC AGCAGCGAAG GGGCTGTCCT 
TCTACCTGGT GGATGCAAAC GTACCGGACA CCCTCATCGC GTTTCTGCAG CATGCAATTT 
CATCAAAGTA TGCGTTTTTG CTCCTTTTGA ATGTACTGTT GCTGGGTGTC GGGTGTATCA 
TGGATCTGTA TTCGGCGATC CTGGTAATTT CTCCCCTAGT GTTACCCCTT GCAGTGCATT 
TTGGGGTACA TCCGGTGCAC GCGAGCGTCG TTTTCCTGAT GAACCTTGAG CTAGGTGCGC 
TGACCCCGCC GATTGGAATG AACTTGTTCA TCGCGAGTTT TGCATTCGAA AAACCGATTG 
TGTATCTCAC GCGCGCTATT GCACCCTTCT TGCTAGCACA ACTGGGAGTG CTTCTTCTTA 
CAACTTACAT ACCATGGCTC AGCACTGCAT TCCTGTAGCA CCGCGTTCCG GCCACAAGTC 
TGAAAAAGTT GAAAAGAAAC GCCGCAGgca TGCTGCGATC CCCGTTTTAT GCGCCGGGTG 
CAGCCtCCCT GCGGGGATTC AATTGTCTGT AT AC C TTTTC CGCCAGGCCG AATCCACCCT 
GCGCGGCTAG CTGCGCACTA AAATGCTCAT AGAGGGCGTC TTCGTATAAC CTTCCTGAAA 
AAC TCCGTTC ACCTGCAAGC GTCTGCCCGC TCAACGTCTC GCGCATAGAC TGCACCATCA 
CTCTCACAAA CAGCGTTTCA AGCTCCCGAG CTTGAGTGTA CAGCGCATCA TTCTTTTCTG 
CAGAACAGGC AGCACCTTGC TGCGCAsGGA aCAAGCGTGC CGCGAAAGAA CCACTACCTT 
CCATTTTCCC TGTCTTAGAC AGGGTAACGG AAGGAACAGA CTGCATCCCC AATGACAATA 
CACGGTGCAC GTTCACCTTT CAGTCTCCTA ACGCTTGAGC GCCACTGCTG TGCCGAGCAT 
GTTGTCACTC GTTTGAATTG CTTTTGAATT AAACTCATAC GCACGCTGGG CGACAATCAT 
GTTCACCATT TCACTTACTG TAGACACGTT TGACATTTCC AAAAACTTAT GCTCAACCTT 
TCCGAATCCT TCAAAACCCG GCCTTCCGGG AATTGGCTGG CCGGACGCAG gTGTTTGGGT 
AAACACATTC CCCCCCTCTG CTGCAAgcCC CGCATTGTTC GCGAAGnaTa CAGCTCAAGC 
TGTCCTACCT CAACCGGATC TCCCTGTTCC CCGACTCGCA CCGTAACGCG CCCATCCTTG 
CTAATAGCGA TACTGTGTTC TACGTAGTTT TCGGGAAAAA TAATCTCTGG AACGAGACGC 
AACCCGTTTG AGGTCACCAA TTGcCGctCC GCATCCACCT TGAACGAACC GTCGCGGGTA 
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TAAGCATAGG TTC CGTCATA TTGCAGTACG CGAAAAAACC CCTCACCCGC AATAGCCACA 
TCTCCGCTCA CACCCGTGTG CTGGAGCGAA CCTTGTTCGA AGAAGmGCTG CGTTG cAGCG 
AGTTTCACCC CGTGCCCCAT CCGTACCCCA ACAGGGGTAA GTGTGTCCTC AGTTGycAGG 
CGTACCGCGG TGCGTATGGT CTGATACAAC AGGTCCTCGA ACTCCGCACG CTGTCTTTTA 
AAACCAGACG TATTCACATT CGCTAGATTA TTCGCTACCG TATCGATGTT TGCCTGCTGG 
CCGTTCATCC CCGTAGCAGC GGTCCACAAA tTCGT ACCAT TCACACCTCC CTCTCACTcC 
GCTACACGCT ATGCTCGTCT ATATCTTCTT TACATTCTAT AAAACATGCG G AC AACTAC T 
TTGCACGCaC CACTTCGTTC CACAATCTGC CCATCATTCC ATCTTCTGCT TGAATAGTTT 
TTTGGTTCGC CTCATACGCG CGATTCACCT CAATCATACG AACCATTTCA TTGACCACGT 
TTACATTCGA CGCCTCAACA AAACCCTGCA CTGCAGCCGG ACGTTCAGGA CCTTCCGCAG 
CAATAGGGGC CCCTGAAACA GGAGTTTGCA TATACGTATC AGCACCCTTC TTTTGCAGGT 
AACGAACATT TTCAAACGTG ACAATTTTCA GCCGATCTAA AAAAAAACCG TCAACGTCTG 
GCCTATCTAT GGGACGTACA TAAATCTCCC CGTTTTGATT GATCGTATAG TATCGCTCCT 
GCAGAAAAAG TGGACCATTT TCTCCCAGTA CTGGATACCC ATTTTTAGTC ATAAGGTAAC 
CTTCTACACC GACTAGGAAA TTCCCATTCC GGGTGTACTC TTCTCCCTGT GGAGTCCTAA 
TCACAAAAAA ACCCATCCCC TCAAGCGCAA TATCCGAAGG ACTTTGCGTT TGTTTAAGCG 
AACCCTGCTC AAATTCAGTG AACAGTTCAT TCACCTCAAC ACCGAGGCCT AACTTTCCAA 
CTATAGGAGA AACGTCCGAA GAACCGAAAG GGTTCTTCAC CACACCATcG TC GTTT AC AC 
GACGCAATAG GAGCTCTGGA AAACTCTTGT GAACTGC T AC ATCTCGCTTG TAGCTTGTTG 
TGTCTACATT CGCTAGGTTT TGCGCAATAG CATCCAGCCT GCGCTGcTGC GCGCTCATGC 
CACTGGCTGC GGTATACCAC CCTCGGATCA TACGCCCCTC CGCTCCCCCT GGT ATCGGG A 
GATAGAAAAG GGGAATCAAG AAAATTCTTT TGTCAGTGTG ACTACTTTTT TGATATTCAC 
TGAGCAAGTG CAATAATAGG ATCGAGTCTT GAGGCCTGCA GCGCTGGTTT TAATCCAAAG 
AAAATCCCCG CTCCCAATGA CATAAAGAAG GCTGTACGCA TACCCGCAGT GCTCAGACTG 
AAAACAACTG TTATCCCCTC TGGAGAAAAC ACGGAAAAGA GCCCATAACT GAGCACCATC 
CCAAGAATAA GGCCACACAC GCACCCCGCC AGGnTTAAAA GCACCGCCTC GAGCAAAAAC 
TGCTGAACTA TTGTTGCGCA CGTCGCACCG ACGGCCTTGC GGAGACCGAT TTCTCTGCGA 
CGCTCGGTTA CGGTTACTAC CATAATGTTC ATGATATTTA TGCCACCGAC AATCAGCGAG 
ACTGCAGCCA CAACCGACAG CACTACACTC ACCATACTCA GAACGCTGCG AAAACTTTTT 



29580 

29640 

29700 

29760 

29820 

29880 

29940 

30000 

30060 

30120 

30180 

30240 

30300 

30360 

30420 

30480 

30540 

30600 

30660 

30720 

30780 

30840 

30900 

30960 

31020 

31080 

31140 

31200 

31260 



Printed from Mimosa 02/03/22 07:22:07 Page: 280 



WO 98/59034 



279 



ATTTCCCCCG CACCGGACCA AAGGCTCACA GAACCCGATT 
AACTCCCGGA TACGTTTTTC CGCTGCGGCA ATAAC CTGCA 
ACCGCGTCTG CCACACGACC TGCACCCATT TCTAGAGAAA 
ACCCGATACG AAGGAATGCC ACTAATCAAA CTCCCCTTTT 
TCAAATGGGA AAGACAGTGC ACGTTCTGCA CCCGACGCCC 
ATACGTTTAC CCAATGCATT CCCTTCAGGA AATAATTCTT 
ACCGCACAGT GACGATGGGT CTTAAAGTCC GCTGGAGAAA 
TTAAAATCTT TTAACTCCAG CCACCGCGGC TCTACTCCCG 
CCTGTGTGAG GAGAAGAAAT AAGTGCCTTG AGGGAAGAAT 
TCCTCGCTAC TTTGTACAAG TCGCGTCCGA TACGACTCAG 
TTCTTCACAT AATCCCACTC TGGCCTGACT CGAATGAGTC 
CTCTGGGCAA GACTCGCGTA GAGAGACTCG CCGATCGAGG 
ACCCCTACCG CAATACCGAG GAACGAAAGG GTCGTCCgCA 
AACAGGGTGT TcACGATATC TTCAAGCATA TCCCTCTTTG 
GCGCGCCTCT CCCCTCCTAC AGAAACCTGA CTTCACGCAC 
GCACGCGCAC ACACATCGCC TAGCGGTCTT CCAAAGACTG 
CTTGGAATGC TGGATACGAC CCAAAACACA CCCCAATAAC 
TAAACATGCC GAc TACGCT A GGAGAAAACG TCATTTGAAA 
CGATAACAAT CACGCTCAGT AAAAGGCCAA GCACAACGCC 
TTAACGTGGC CGATTCTACC AAAAACTGAT GAAGCACGTG 
CCTTCCGCAA CCCAATCTCC TGGCGTCGTT CGGCGACCGT 
TACCGATGCC ACCGACAATC AGTGAAATAG CTGCGATGCC 
CCCGGAGAAA ACTCCGCATC TGTTCAACGA TAAGATGGAG 
TCTCGTTACC GGTCAGATTG GTTAGCACTG ACTTAACTTT 
ATCCAGAATC GTATACTTTC AAATCCATTG CATCGGCAAT 
TCAGCmmA 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 



TGTTAGAGAA 
CATCGCGTAC 
TAAACTCACG 
CCTGCAAGAC 
GGGAAAGTAT 
GCGCAAGCAA 
AGAACGTCCC 
TAATGTTCCG 
TGTAAAACAC 
TCGGCTGAAA 
GGCGCTCGCC 
TAATTACCAC 
GCACACGCTG 
C AGAGC cCGT 
ATGGCAACGC 
GATTGGGTCA 
TACTGACCCT 
ATCAAACGCA 
GCACAAACCA 
CATGCGCGAA 
CACCAGCATG 
CGTCAAGACC 
GGAAGAAACT 
GTCCTCAACA 
ACGCCGAGAG 



AAAGTCCGAA 

GCACACCTCC 

GGGGACAAAA 

GCCTACGATT 

GGTCACAGTC 

ACCGCCAATC 

ATACTCAAGC 

TTCCTTTCCC 

TCCCTCTATA 

CATGATTTCG 

CTCGCCAACA 

TACCGACGCA 

CCTGAAAtAC 

yGCTCTACGC 

CACAGGACAG 

AGACC CGCGG 

GCAAAGGCAA 

TTCAATCCGG 

CCTACGAACG 

GCACCCAGTG 

ATGTTCATAA 

ATATTCATTG 

TCAAAGGCCT 

TGCGCGATCG 

AACTCTCTTG 
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(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 



CGGATACCCC GCAGTGGGCG CCGTTGCCGC CCCCGCTCGC TTGGACCGTA CGACAgTGGA 
CGTCCGTGCC GTCTAACACG TAGAAGCACC CCTTTCCATC GCTCGTGCAG CCGAGGATGG 
GGGACGTATA GCGGTAAACG TGCCGTCCTT TGTGTAGGTG CAGCTCGTCC CCGAACTATT 
GCTTTTGGTG TACACGGCCT TGGTGGTTAC CACGTACCCG GTTCCGGAGC CCAGGACGTT 
TTCATCACTT CCGTTTTTAA TACCACCATC GGAAGAAGAG CCGCTCCCTC CGCCTCCACC 
GCCTGAGGAC CCAGAGCTAA GTTCACAGGG CATTTGCAAA AAGGGATTGT CTGACCCGCC 
AATGCGGATT GCCCCATTTG TTTTGCCTAA AACGGTTGAA GGCGTCGTGG TCCCTCCCGT 
TCTTCCGGCG CCGTTGCTGG TGTACGTGTA CACACCTTCC CCCGAAACAC AGGCGTACAC 
GCACGCGCCt TCGAAACAAT GCTGGTAATC TTTTTGCCTG GCAAAAAATT TACTGCCGTC 
CAcTTCCCCT CAGACTTGCT CGCGTCCTTT TCCCACAGCT GACCGGCGCA GGCGTACAGC 
TTGTTATTGC ACTTTACCAA ACCCGTCACT ACCCcGCGGA TGCTCGGTAT TTTTAACGGT 
AcTTCTGACT GGATGGAGGC AAAAATGCCa GAAAAATCGC AGCCGGTTAA G AG AC TTGCG 
CTTAAAAATA ACACCGGCGG ACAGACTATG CGGCGCACTA CGCGGCTTGC GCCGTCTGTG 
TGCGCGCGGC ACGCCTTcGC CGCGgCACAG CCTcCACCTG CTTGGACTAC AGCTCTTTCC 
AATCCTTTCC ATTGTTCCCC C CT AT T AC TG CCCTCATGCA GACGTGGCAC GTCCCCGCTC 
CTATCACAGG CACAGAGACC CCCCTACCAA AAACAAAGCA CACTGCAGCC CCCCGAGGCC 
GCTATCCGCG CCACCGGGGG GGGGGTATCG GAGGTTCGCG CGCTTCCtCC TCCGGTTGTA 
CCGTTTCCTG AGGAGAGACG GCAGATGCTG AAAAGGGTTC CCCTTCCCCA CTTTCTAACC 
AGAGACGGTT AACCGCAAAC ACCACACACA ACAGCAACGT CAAATcACGA yTGAGCGGCA 
TCGTGCCGCG CTCAAtCGCG TsGAGyTCTT CAAcACCAwT CCCAAGTTCC CGCGCAAAAG 
CCGTCTTACT TAAGTtTTGC TCACAGCGAA CGCGCTcAAC ACGCACACCC ATGCcTTCCA 
TCTTCCCAGT ATACTCTAAA ACACCGCCGC GCATAGCCCC GCAgCCGCAG CTTGCAACAC 
tACGGGTGCA GGGCAAAAgG AGTwATGGGA CCGGGCTTTT TCTCTCCCGC CGGCATCTGG 
GCAAGGATAC GAGGCACACC GTCCGCCTTG CGCGCATCAC CGAGTGCATG ATACGTCCTG 
CTTAAGTTGT ACAAGGCACT CCCCGACCGC GGCAGCAAGC TCAGCGCCGT CTCAAACGCA 
CGTGCGCCTG ATCGTACTTT TGcTCGGTGT GCAATAGCAG TCCGTAATTG TTCCAGGTAG 
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CACCGTTCTT CGTTTCCAAG CGGATGGCGT GCTCGTACGC ATTGCGCGAA cCTCAGTGTC 
CCCCATATCG TGGAGCACCA CCCCAAGCAT GTTCCACACC GCCGCGTGAT GAGGGGTGCA 
CCGCAACGCA TGATACAGCG CACCTGCTGA ATCAACCGTA CGCTTGAGCG CATAATAGCC 
AAGCGCCAGG TTCAACCACA GAAGCCCGTT GAAAGGACTG AGCCCCAATC CCTTGCGTAA 
ACAGGCAACG GTTTCCTCAT ACCACCCCTT GCGCGCGCAG GAAAGCGCAA GACAGTTCAA 
AGACTCGGCC GTCTCCTGAA CCCCACGGGg cTGCCGCCCG TGCTCAGGGC ATTCAAACTC 
TTCAAAAAAC TGATTCACGA GTCGTCCTCC TCCCCTTCTC GAGCGGGTAT GCGCGCGCAg 
TATACCAGCA GCACCCGCAA AATACGAGCA CCGCGCACAC CCCACCTTCA GTCACCCTCC 
CCCCAGCCAG TCAAAACCAC GTCCCTGcTA TCGCGCACAC CCTGCGCACT GCAAGAGcTG 
CGCAACTTCC TGCAGCCTCA TGGTGCGAAA TGCTCCGGGC CTCAGCGGTC CTAGACGCAC 
CCTGCCAATG CGCACCCGCA CTAAGCGCAC GACATCCTGT CCCCACGCCT CGAAT AC C AC 
ACGGATCTCG CGCTTTTTCC CCTCAACCAG TACAAGCTGT ACACACTGCG CTGCAAGATG 
CCGCGCGCGC ACGCACCGAT ACCGGcACCC TTCCACCCAC ACCCCACGCA CAAAAGAGCT 
CAACAGCGCT GCAGGGACTG GCTCACGCGT TTCTACAATG T AC TCTTTC T CTATTCC sGA 
ACGCGGATGG CCAAGAGCmT GCGCAAACGA ACCATCATTT GTGAACAGCA GCGCGCCTTC 
AGACCGCACG TCCAGCCGGC CGATGTGATA TAGGCGC TCC TGATACGCAG c TG TACT AAA 
TCGATTGCAC GCGCGTATTC CTGTTTGGAC GGGCCTGCCC GCACCTGCGT GTGTGCATAC 
CCGGCAGGAA ACTGCGGCGC GAGGGAACAG ATATATCCAA CCGGCTTATA CAGGAGCACG 
TAGCGCTGAA CTCGTTCAAG CTGCACGACG GTGCCGTCCA CACACACCAC ATTCTGCGCA 
CAAACGGTCC GTCCCTGTGT CGTAACCGTC TGACCATCAA CGGTCACACG CCCTGAAGCA 
ATCAAGGCCT CACAGGCACG CCGGGAGGCA CAGCCACTCC TGGCTAAATA GACCTGTAAC 
CGGAGGCGAA AAAACGGCTG CAGGCGGCAC ACCCTCCCCT GCACCCTCTG TTCACCCGGC 
TTAACGGGTA AGCTCAAAGC GCCGCTGCTC TTCTTCATCA AGTTTGGGCA AGTCTGCAAT 
GCTGCGCAAC CGGAACGCAG TCAAAAACTC CTCAGTCGTG CCATACTGCG CCGGCTTGCC 
GGGTATGTCC TTTTTCCCCA CCTCGCAAAT CAGACGGCGC TCACTCAAAA GGCGGATCAT 
TGTATCTGCA CCTAraCCCTC GGATTGCCTC TATTTCAGCA CGCGTCACCG GCTGCGCATA 
GGCCACAATA GACAGCGTTT CCATTGCCGC GCGCGAAAGG CGCCCTTCGC TCCGCTTCCC 
ATAGAGGGTT GCAAGACGCT CCCGTACGGT CGCCGCAGAG ACwACGCCAC ACCCTGCTCG 
TTGCAGTGAA GCTCCAGTCC ACCACCACCA CGCGCGCCAG AAGCGAGAGC TTCACCCAAA 
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CGCGCAACAC ACTCACCCAC TGCCTGTTCG CTCAAACCGA GCTTTCGTGC AAGACACGCA 33 60 

TAACTGAGCC GCACGCCTTC GACAAACAAA ATAGCCTCCA GwAGCGCAAG GTCCGGTGCG 3420 

GGTGCTCCGT GTAGCGTGCA AGGCTCTGCT TGGTCCATCC TGCCgATGtA CGCTCTTTCC 3480 

tCCCTCCTaC AAGCACCCAA TGCAATCTAA CGACAGGGAA GACGGCACCC GCcTGTCTGA 3540 

CTTCCGTTAC GGATTAAAAT GACCGATCTC CGGCGCACGC ACCCGCACCA ACGCAAGCAT 3600 

CTGyTGGGGa TAcTGCGCTa CAAACTCTTC AAGAGTGCTC AAACGCGCCG CTTCAAAAAC 3660 

CTCTACCTGA AATCTCCCTG ACCGATCGAA ATACGGCAAA AACACTGCCG CACGCTGGTA 3720 

TTCACGCAAA CGCGGTtACG CCACGCTCCT CATTGAGCAC GCCAAGATAG AAgTGGCCTG 3780 

GTTCCATAAC TGCAAGATAG TAAAAGAGCG CACGCACATA CCCGGTCTGG TATCCTGCAT 3840 

GCGGCAAGTA CCCTAGGTAC CGCGACGGCC CAGCCGCAGT TTCACCCTGC GTCACACGGG 3900 

GAGGAACAGC ACCCGCAGcG TCGACGGGGG CGCAATACCC GGATGCCGCG CCTCCTGCTT 3960 

TATCCTCCCT TTGGGGACAC CCTCCACTGC AAAGGGTTCT ACCGTTACGT CCACACCTCC 4020 

CTGTGCGGGG TACACCGTGC GCCGCACATT AAGACTCAAC GCTGCCGCTG CCAGGTTGCG 4080 

TATCCAGTCA ATGCCAAAAA AGGGTkCGCG TGCGCGCTCA TGCGCCGCGT TAAAATGAGT 4140 

ACGCGGcATA GTCGTCTGcT TTTTTAGCGC AGACAGGTAC ACTCCCTGaC CGGCAACCGG 4200 

CGCAATGATC CCGTCTACCA CCCACTTCGC AAACCCAAGA GATCCCACTC CCCCGATAAT 4260 

CTCCCGCGGC ACTTGGTCCA CACGCAGGGC ACGGATAATC TCTTGGTCAC TTTGACTGCT 4320 

CCCGTCAGAA ATGTGGACCG GGCkTCCCCA TTCGTTAAAA CATCCATCCT CAAGTGGAGC 4380 

GAGCCGGTAA CTCCTGCGCT TAATGATACT CGCCATTGAC TCTACTGCGC TGTAcGTGCG 4440 

CGGCGGATCA AAAATACGCC ACGGCACCGC ATGcTCcGTA AGCGCCCGCA GCTGCAAAAG 4500 

CGAGTaGGTA TACAACTGTT CAAAGGACAA CCCAAGCgCc sTGyTTGCGC a cGTGCACGT 4560 

TAAAAAGACA CACATCaAGA TGCGTCTTTC CcTGCATTTC TACTGCAGTG GGCGCAGAAA 4620 

GTTGTACATA CAGCGACGCA CTTTCGCGCG GGTATACCCG TnATGaCACC GGCGCACCGG 4680 
TGTGCATGTG CCGATACAGC ACCCATGTGC CTTGCGCCAC AGCTTGACTA GGCGCAgCGT 4740 

CTGTCACAGG TGATTGCGGG GTATCTGCGC CTGTCCTTCC CGCCCGACTT GTTCCTGTGG 4800 
GCGCCTCGAG CGCGAGGTGC AGCCGAGGCG CCGCTGCCAC TGGCGCAGAA TGCACACCAT 4860 
CCCTTGGGGC ACGATCAGCT AAAAGAGGCG TAACGCGCAC TGCAAGTGGT CGGTTGCGGC 4920 
AATCACGTAC GCCTCAACGC GAAAGcTATT GCCCGTTTCG TCCGTGTAAt AC GAG gTGCA 4980 
CGCArCGCAA CACGAGACGG CTCATCCAAA AACCAATCtG CGCGATAACA CGnCnCACAT 5040 
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GGTGCGAGTC GGGGATGCGC TCAAAGGGCA ACAGCGGCAA ACGCTCCCCT GTCTCTGAAA 5100 

CGCCGGAGTC AACCATGATT TCTATGGaGT ATTTCTTGCT CAGCTCCTCA TCCACCTCAC 5160 

TCGCGCGCAg CAGCCCCTGC AGCATTATCA CCAAACTTAT GCGTAACACA CCgCGCGCAC 5220 

GCgCAsCCAG ACAGCTCATA CGGCCCCATA TCGGCGCACG CGTGGGCGGG AAAAAGCAAG 5280 

ATCTAACCGT TCCCATTGCC CACAAACCCc TTGGCAGGTG CATCCtGCTG TGCTAgGGTG 5340 

CGCCCGCCtT GcACCCTGcA GCTGTCCTGT GTTAAGGAAG TC AAGAT AC C ATGAACACGC 5400 

GCCTCGCTcT TGTCCTG t GT GCGGTGGGAT CTGGcGTGCT GTCTTTCTCC TGTGCACGCA 5460 

CTGcCGAACC GACCCCCGCA GCTTCCACAC ACGTCCCTGT CACCACCGCC GGCGC AC TC A 5520 

GTGTCACACC GCCTTCGAGT ACTGACCGCT GGTACCAGTT CTCACGCACG GACGGACGAG 5580 

TGCACCTGCG CGCGTGCCCC GCGCCGTCTC AGCCTTCTGC ACCTGAACAC TTTGTACCCT 564 0 

GGACTGAGGC TGTACgcCTG TCGGCAGTGG ATGCACAGCA AGAACTCTTG CTCATCAATC 5700 

GCGCCGGAGT ACTCCCAGCC ACGCAtTAGC CCGCATGCAG ACCGCACCGG TTCCACGCAA 5760 

AGCACCCTCC ACACCCGCTG CGGAGACGAC ATCGCTCACT CTGACGCCCC CCGCACTCTT 5820 

AGCCACACAG AGCGCTGAGG GCTTTTACTC AGAGCCAATC CCCAACAGTT CCCCCCACCC 5880 

TTGCC AGGGT ACCGGTGCAG TGTTTGTTCG TCTCTACACC GATCCCCTTT TTACCACTTC 5940 

ACCACAAGAC TCTGCAGCTC CTTTTCTGGT GCGTTACGAT GTGCGCACCG CTCGCTGGAC 6000 

TTCTGTCGcA TACACGCGmG CTCTGGGaTT GCCCCGGAAC GCCCAATGCA cCGCCCTcAC 6060 

CCATACTCGc GGCACCTGGT ACGCTTCCTT TAAGTCCTCA GAAGCAGAAC GCGTTTCCTT 6120 

TGCATACTTC TCCTTTCCCT CCCTTTCTTC CCTTGAGAAT TTGGGACCTA CCCAGCGCAG 6180 

GGAGCATCCA ATAGGGGGAA AAGTGCAGGT GCCCCGTCCT ATCTCCGCGG CCGCCTTTCG 6240 

TGCCGCGTGC ACTCCCCAGC GCCTGcACCT TCCGACGGcG TCCAcTTCCT CTGaTCATCA 6300 

CAGCGACCTG cACGAATTGC TTGTGCATCG CTTGCTTGCG CGCGTACCTC TCTCTCCCCT 6360 

GTACCTTTCT GCGCGAAcCC GTGTTGGGCA AGCGATCGCT CCTTTTTAAA GACTGCCCAC 6420 

CGCACTGCTG ATGAGCGTGC ACACCACGCG AACGnGCTCA TTTTTCATCC ACCGCGCGCA 6480 

CGGCTTTCTG CCGCACTCTT AACAGATTCA GGCCACCTCT ACTTTGTACG AGAAGACGGC 6540 

TCTGAAGGCC ACGCACGACT GTCCGCCTTA CCCCCGCAAT TCGTGTACAC TTCCTTCACT 6600 

CTCTCCGGCC CCTCCCTCAT TGCAGGGTGG GAAGAACAAG ACTTCTTTCA GGTAGGTAGT 6660 

ACAGGTCTTT TATGCACCGA GGTAGAATCC CTTACAGGAA CATAAACGCT CCCGGGAGCC 6720 

TGCTCTATGT CCTACAAATT CTGTAAGGAC CCCCACGTCA GGTGCCGACA CTACGTTGGC 6780 
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AGGGGTGGCG GATGAAGCTA AAGCGCTCAT TAATAGTCGG GGGAGGCCTG TTGCTTTGCT 
GTGCGCACGG ATATGCGCAG GCGAAGGGAG CACGGGCGTC TGTGCATATT GCGTACCATA 
ATCGCACGAT TTACTTCCCC GGCACCCACG AATCTGAACC CATTTGGGTG AAAGTTTCAC 
TTACAAATAC GGGAAAGGAC ACGTTGCGCT TCAAACTGGC GGACGACCGT ACCTTTAGTG 
TTGATTTTTC TATACGGACG ATGAAGAACC GCGCGCTTGC gCACACGGAC GAATGGATAC 
GCAAGCGGAG CACTCATCGT CCTGTGTATT TTAGGGAGAT CAGCCTTGAG CCGGGGGAAA 
GCTACTCTTT TGTGGAAAAT GTGAAGCATT ACCTTGATGT GCAGTCGGCA GGGTTGTACT 
TTCTAACCCT TCTCTTCTAC CCCGAACTGA AAAGGGAGCG CACCGGTGAC GAGGACCATC 
TGGCATCTAA TACGCTAACT CTTGAGGTAC AGCCTGCCCC TGCTGCGGCG GCGCTCGGCG 
CGTTGCCGGT TTCTCCCCCC GTGGGTGAAG TTCTGCAACC GCAACGTCTT TCCCCGGATA 
GGGTTATCGA GTACGTGCTG AATGCACGGC AAAAATCTCA CTGGGAACGC TTTTTTCTGT 
ATCTTGACTT GGCAAAAATG CTTTCTCGGG ATGCGGGGCG CAgTCGCCGC TTTAACGcAG 
AGTCTGAGGC AGGACGCTAC AACATGATTG ATACCTATAA GCACGAgTAC GCCAGGAGCG 
TGTGGATAAG GATATTGCTG CCATACCCGT TGAATTCCGT ATTGAAAAAA CCGTGTATAC 
TGCTACGGAC GCGGAGGTTC GCGTGCTTGA GTGGTTTGAG T AC CGGGATT TCCGGGAAAA 
GAAGCGCTTT ACCTATCACC TGTCCTCCCG CGACGGCATC TGGTATGTAC ACGATTACGT 
AGTTGAGAAT TTGGGAACAG AATGATGAAG GCACTTTTAG TCGCAGATGA TCCCGTTTCG 
GTGAATCTGG TATTTGAAAA CCACACGCAG TGCGGTTATG AGGTGATCCA tACCGTTCTG 
CGCTGAAAGC CTTGGACAAT ATGGAAGAGA TTCAGCCACA GCTGCTCTTC ATCAACGCCA 
GCGACTTTCC GCGACACTGG AGGGTCCTCA CTCAGTTCTT TAAACATCAG TCGGTGTGGG 
GAgCGCGCGT AATCCTGCTA GTGAACACTC CGTCCTCCTC TCTCAGCGCG CGGCAGGTGG 
CGCAGGCAGG GGTACACGCG CTTATCGATT ACACTCTATC TCCGGAGGAG GGACGAAAGG 
CTTTATGCGG CGTGCTCACG CCCTCCGCGT GCGCAGGCTC TGTCGACGTG GGTCATGCGC 
ACACCTGCCA GGCAGATTTC GTGTTCACAA ACCCCTGTAG CGGCTCTATT GTCACCGGGA 
CCGTACGGGA AGTGAGCGAA GAGGGCGTAG ACTTCATCCC CGACTTTCCC GCGAGCGTCA 
ACAATCTGCA AGAGCAGGAT GTACTCGAGC ACTGTGCGCT AAAGGTAGCA CACGACATTC 
TCGGTGTTCG CTGCTCGTTC CATTCATCGG ACGGGCGCAT CCTGCTACGT TTTATAGATC 
CCGATGCGTC ACTGGTACAT GCAGTACGCA GCGTCACAGG TACCACATAG CAACGGTACC 
CACACACACC CCAAGCAAGC AAATGGCTGC GTAGACCCAG GTGGGCAAGG CCTCTTCGGC 



6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8340 

8400 

8460 

8520 



Printed from Mimosa 02/03/22 07:22:13 Page: 286 



V/O 98/59034 



285 



P< 




ACGGCGGGGG GCTCCtTCGG TGCCGGGGGG CTGCCCTTTC TCCGGTTCTC GTTCCCGAAC 
GCATACCCAC AGGAAGGnCA GCCCTTTTCG AAGTCTCTAG CGTTTCCTGC GTGTTTGCAC 
AA 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 61 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TTCTCCATGT TATGAGATTG GACTCCTCGT TGGAAACGTT CCTTTCAGAA nAGATAGCGT 
TAACCGTGGA TTCGT t ATC A ATATTGACGC TCTCTCTCGA TGAACAAAGA CAAGTTCTGC 
AAGCGGTCCG CACCGTTTTT GTTCCCACAC GACAGGAGGG GTATATTCCC GTGTTGCTAA 
CGACGGATAC GATTCGTAGC GCAATGTGGA ATTTGTTTTT TTCAGATCGT ATTGAAATCG 
CAGTTATGTC CTATAAAGAA GTTTCTACCG ATATGCGTAT TGAAACAGTG GGAGTAGTAA 
GGATAGAAGA GAGTGATGTG GATGCTTTTG TGAGAAAGCA GTAGTCTTCG GGCACAAGAT 
GGGTGGAGGG TTTGATAGGT GGAGTTATTA GTAGAAGTTG CCCCAACGAA GGAAAAAGCG 
ATAGAGAAAA TTCGGAAAAA GTATGGAGAT cGAGTTAATA TCCTGCGCAC GCAGAGGAAT 
AATAGGAGTT TCTTTTTTGG TCTCATAGAA CGAGTCTCGG TAGAGATTTT TTTTTCTGTC 
AATAGTGGAT CGCAATCATC AGTACACGAG ATACCCTCAG TGCAATCgCG TACGC t GTGT 
CCGCTGCTCG GGTAGAGGAT ACTGAAGCAG AAAAAATAAA GATACTTGAA TCTGCGCAcG 
TATTAATGCG AaGATAGCAC AGCAGGTAGA GCCCTTAATT TCAGCGGCAA AAGAGAAGAA 
AACTGAAAAA GTGCCAACTT CCCCTGAAGC GGTGCATGCG CTCACTCAAA CGCTAGAGGG 
TATGATCCAG AAGATcACGA ATAGTGCGCC GGTGGTGATA GCACAGGAGT TGCAGTCGAT 
TCAAAGAATC GAACTTCTTT TAGAGGAAAA TGATTTTAGT TTTTCATTTA TAAGAAAAAG 
TATTGCTCGT CTAAAGGACG AACTCAGTTA TCATGATTTA GAGTCTTTCG AAAAAGTTGA 
ATCAACAGTC CTGCGATGGA TTATAGAATC AGTCCACATT CAAGTTCCCC CTATTTGTAC 
CGGAACAAGA AACATTGTAT TAGTAGGACC GACTGGTGTG GGAAAAACCA CTACCCTCGC 
AAAGCTTGCC GCGTTCTATT TTGTTACAGA ACCGAAGCGA ACTGGTATTC AGCCACGAGT 
AAAAATCATT ACAACGGACA ATTTTCGTAT TGGTGCAGCG TTTCAAATGG AACGTTATTG 



8580 
8640 
8642 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
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960 
1020 
1080 
1140 
1200 
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cGAGCTTATG 


GGACTCGATC 


TGTGTGTAGT 


GCAAGCACCG 


GTTGAGTTTT 


TGACGTACAT 


1260 


GACACTGTAT 


CAGCAGGAGA 


CCGATGTGGT 


CTTTGTGGAC 


ACGGAAGGgA 


GGAGTCCGGT 


1320 


TGATGGACAG 


AATATAGAGC 


GGATGGTGGA 


ATACTTtCGT 


GCGGTAAAAA 


ATTTtGAACt 


1380 


GGAAGTGTAC 


CTTACCATtG 


ACGC t GGATC 


GAAGGCGAAC 


GACTTGCGCG 


AGGTGTTTAA 


1440 


GCAATATGCG CTTTTgAGTA TCGTGCGCTG ATAGTAACCA AACTtGATGA 


AACAACAAGT 


1500 


ATTGGaAACC 


TCATTAGTGC 


GTTGAGTGAG 


GCAAGGACTC 


CTATCACCTA 


TATTACGACA 


1560 


GGACAAACGG 


TTCCAAGCAA 


TTTAGAAAAG 


GCGTCAGTAA 


ATTTACTACT 


TTCTAAATTA 


1620 


AAAGGTTTTA 


AACTTCTTGC 


TGAGGAGATG 


GGCAACGACT 


ATGGTGATTA 


CGGTAGCAAA 


1680 


GAGAGATAAG 


CGCATAGCAG 


ACCAGGCAGA 


AGAGCTGAGG 


GATTTGATGC 


AGGAAAAAAA 


1740 


TGCGCGGGAG 


CtGTTGAACG 


TCATCAGCAT 


AGAACGCGTG 


TTGTCGTGGT 


AACCAGTGGA 


1800 


AAAGGCGGGG 


TGGGAAAGAC 


GAATATTGCA 


ACGAATATGG 


CAATTGCTTA 


CGGGTACATG 


1860 


GGGAAAAAGG 


TGGTACTCAT 


AGATGCAGAT 


CTTGGACTTG 


CAAATGTGAA 


CGTGATAATG 


1920 


AACGTTGTTC 


CCCAGTATAA 


TTTGTACCAT 


GTGATCAAAA 


AGCAGAAGAA AATGTCTGAT 


1980 


ATCATCATCG 


ATACTAATTT 


TGGTATCAAG 


CTCATCGCTG 


GTGCATCAGG 


GTTTTCCAAG 


2040 


ATTGCaAATT 


TAAACGAAGA 


AGAGCGTGCA 


GCTTTTATCC 


AAGAGTTATA 


TTCTTTATCG 


2100 


GAGACGGATA 


TCATTATTAT 


CGATACAAGC 


GCTGGTGTTT 


CGAAGAATGT 


CGTAAGCTTT 


2160 


GTTGCATCTG 


CCGATGATGT 


CATTGTTGTG 


ACCACTGCCG 


AACCTACGGC 


AATCACCGAT 


2220 


GCGTATGGAA 


TGATAAAGAT 


CATTGCAACT GAGGTTGATA ATCgGGATAT 


GAACTTGAAG 


2280 


ATGATAGTAA ATAGAGTGAA TTCTGCCgCA GAAGGAAGAA 


GGATCTCTGA 


ACGCATGATA 


2340 


CAAATTGCAG 


CTCAGTTTTT 


AAATCTGAAG 


TTAGATTATC 


TGGGCTTCAT 


TTATGACGAC 


2400 


ACcTCGGTAG 


GTGCGAGCGT 


TCTCAGACAG 




TAATCCACGA 


GCCTCGGGGG 


2460 


AAGGCCTCCG 


TGTGCTTGCG 


CCATATCGTG 


GCAAAGCTGG 


AAAAAACAGA 


GATCGCCGAG 


2520 


AC AGGCGGGC TTTCAGGTTT 


TATTCGCAGG 


ATATTTGGAA 


GGGAATGGGA 


ATAAGGCTCC 


2580 


CCCTTTCCCT 


ACCGACTAAG 


ATTGATGAGA 


AGTTGGACCT 


CCCCCAGTGG 


CTTGCCGGTC 


2640 


TTTTCCGCAA 


TGAACTCAGG 


GGCAAGTCCC 


TTCTCAGATA 


GGGCGATGAT 


GGCGTCTTTA 


2700 


AGCAAAGGGG 


ACTCAGCTCT 


CAGCCCGTTG 


TCCCGAATGA 


TTTTCTCGTT 


ATAAACCTCA 


2760 


ATGGTGCCAC 


GCTTGGTAGG 


GGTGGGTTTC 


GCTGCACTCG 


ATCCTGCGCG 


TGAAGGAGCG 


2820 


CTCCGGTGCT 


CAGGTCGCGC 


GCAAACCTCC 


TCCCGAGTCT 


CCTGCCCCGC 


ACCCGATGCC 


2880 


CTGAACAGGG 


TGCGAGCCGT 


GTGCTCCTTA 


AGTATTTCCT 


CGTTAAGAAG 


AGTGAGTTTT 


2940 
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CTGTCGATCG TGCGGACGAC C TG G TT AC AC TCTCCTATCT TCTTCTCGAG CATTTGCACG 3000 

GCAATGTCCG CTTCATACCG TATATCTCGA ATCATCTTTA TCACTTCCTG TTGCATCGTT 3060 

TTCGCATAGG CGTCAGGAGA AAAACTCGTA CGCACCTTCA CGTAGAAGTA CACAAGCAGA 3120 

GCAACGGCCA CGAAAGAGAA CGTAATCGAC ATCACCAGCA TAGCGTATTC CTCATTTTCG 3180 

CTGCGTATAA GGAAAACTTA GTACGCATAA CCTGAAGGGG CTC TTTTATA ATCTTCATCC 3240 

TGTTCGCCCG TGTATTGGCC ATAGCGAATG AGGCCATTGC GTAAATAAAA ACTTATGGCC 3300 

GTGCTGTCCG CGCGCACCTC TTCGCTCACG TCTCGAATGG TGACTTTTAC ACCAGAATAC 3360 

ACCTGTCCTG AGGCAGAGAT TTTCCCCTCG ATTGGAGAAG CATCCAAGGA TGCCTGAATC 3420 

TCTTCGAgCT CCGCGCGCGA CTGCTGCACT AGCTGCTCGA GTGAGATCTT TTCCTCATGC 3480 

AG AC TAG TC T CAAGCGCCTC CTTATCTGGG GGAAGTTCCT TACGCGCTCT CTTTAAATTC 3540 

TCGAGGGATT GGAGGTTCAA AGACAGATCG GAGAGTTTTC GTTCATGTGC GTGCAACTCT 3600 

TCCTGCAACA TGCTGAGGCG ACGTACACGG TGCGGATCAA AGCCGACGCT GATTTGCGTG 3 660 

TCGTTGCCGC CTGATTGGCT GCCTAGGTTG CGCGCGTAGA CAGCCTCTGC CGCTGCAACG 3720 

TTACTTCCGA TGATGTCGGC ACGCCGCCCA CGACAAATGA TTTTCCGGTT AGCAATGACG 3780 

TGCGAGTTCA TAATTCCGTC AGAAACAATG ACAAGATCTC CTGCTTCAAC TGAGGCGCAA 3 840 

TTCTGGATGA ATTTAGCCCA CAGAGATTTG CCTGCACGAA CGCATCCTTC CTCCTTTCCC 3900 

ACAAT Ac CTT G TCCG ACT AG AATGTCCCCT TCTGCATCAA GCAAGGCCTT TCCCACCGTT 3960 

CCGCGCACTT CGATGTTGCC TGAGGCCTTA ATCTCGTAGT TATCCTCAAC GTTTCCGTGT 4020 

ACCAACACGG TACCAAGGAA CATAATGTTC CC TGTTTTT A CAGAGACGTT TCCTTCTACC 4080 

AC ATAGATG G GTTCTACGTT GATGCCCCTT cGGGAAAGCA GGGCTTGTCC GTCAGTTTCT 4140 

GCAATGACCG TAAGGCCGTC aCGCGCAAGc GCTGTGTTTC TTCCCAGAGG AATGGACACA 4200 

TCCTTTCCCG ACTGTGcCGG AAGATACGTG CCCGTGACGG TTTTGCCAGG AGTACCCCGC 4260 

TGTG cAGGC a GCTTCTGCGC AAGCGGCTGT CCTTTGACCA CGTTATGAAT GAGGTTTAAC 4320 

TCCTTAAAGT TAATCTTCCC CGTCTTGAGC TCTTGCAAGT GCACACGGgT GCGGTCAGTT 4380 

TCGAAGTGAT AAGAAATCCT CGCATTTTCA CCGTCCTTTG GAGGGGTGCC CCGTGCAACG 4440 

AGGTAGGGTT CATGGTAAAC CGGACAGTCT TGGAACGAAT TGACGCGTTC CATGTCGATG 4500 

CCGTACACAA CCCGATTGGA GCGCAAGAAA GACAAGATGG TGTCCGCGCA TATGTCAGCG 4560 

CCGTTCCGTC CAGGGGGGGT GGCAGTTACA AAGGCCTTCA TGTCGTTTTC TCGGATCTCC 4620 

ACAGAAAGCA TTGCATCATG TGCAGGGATA CGTTCGAATG AAGAAACGTG CACGTAGCTG 4680 
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TTCGTAGCGT TTTTTATCAG CACCTTGAGA GTGTCAGCGG GAGGCAGGGC AAGGCCGCGC 4740 

GCGCGGAACT TTTCCTGGAC GCTGGCGAGT GAAACCTTGC GCCCTTTACC GAGGGGAGCG 4800 

GTGATTTTTA AAAAAACGCC CTCTTTTTTG CAAAGTACAA AnGCTGCGCC GTCGTGTCCG 4860 

GTGTTGGGGG AAGAAGACAC GTCCCGGTGT GAATCGTGCG GTGCAGACAC GTTCCCGGCA 4920 

GCGAACTCCA GTGAAGAGCT CTCGTAGGCG CGGATTTTCC ACTTTTTTTG GCGGAAGGAA 4980 

AAGAAACTGC CAGCGCCTCT TTCGAGCACC TCGTATTCAA CGCGGTATTT CGGTATTCCT 5040 

AATTGAACAG CAGCAGCCTC AAGTGCCTTA TCAAGTGTTT TTGCGCACGC ACTGACGCAG 5100 

ACACGCTTAG AATCCTCCTC GTAGCGTTTC TGCATATCGC GGCGAATTTG ATCAAAGCGA 5160 

GTATTCATAG GGAGTATTAC CGGATACCCT TTTTGATGTT GGTGAGCTTT GCCTTTAACT 5220 

TTAAATTCGC GCTGGTGTGG ATCTGAGAGA TACGCGACTC GGTCACTTTG AGCACCTTGC 5280 

CAATCTCCTT TAAGGTCATT TCTTCGTAGT AGTATAGTAT GAGCACCTGC TGCTCGCGTT 5340 

GAGAAAGTTC CCTAAtTgcC TCTGcgAtGa tAcGctTgat TTcCtCgcgt TcgaCaATGA 5400 

CGTCGGGATT GAGAGAAGCG GGCGCTTCGA TGCTGTCTCC CACAGAGACG TGGTCTCGCT 5460 

CATCTCCACC AAACTTCGAA TCGGCAAGGG AAATCACGCT CGTGCCGGAC ACCTTCAAGA 5520 

GGAGCTGGTG GTACTCTTCA AGCTCAATAT TCAGCGCGCA CGCGATCTCA GTATCTGTGG 5580 

nCAwsACcCC AAGGCGTGCC TCTAGATCTG CAATCGCTTC TTCTATCTGG CGTGTTTsTG 5640 

ACGCACCGAC CGGGGAACCC AGTCGATGGA GCGCAGTTCA TCAAAGATAG CACCGCGGTA 5700 

TGGCGTAACC GCGTACGTAT TAAATCGAAT GTTTTTTTCT GGGTCATATT TATCGATAGC 5760 

GTCAAAAAGA CCAAAGATAC CGTAGCTTAC GAGGTCATCG AACTCAACGT TCCCCGGTTT 582 0 

CCCAACGGCA ATTTTGCTTG CAACGTATTT GACCAGAGGA GCGTACTGCA CAACAAAGTA 5880 

CTCGCGTATT TTCGCGCTAC GnkTCCTCCG ATACTCGAGC CAAAGCTCCT CTTCCGACTG 5940 

CTGTTCGAAG GCTGTGTTCC CCATTCCCGT GCCCTCTACT GTTATAGCTG ATTTCAGAAT 6000 

GAAAATACAA GCCGGCTCGC TAGGTGTCTT GGGAGAGCAC GGTTCGGATG GCGCGTGCCA 6060 

TGCGTTTAGT CTCGGGTAGG TCAGTGAGAA CGTCACCGAC GAGATCAGAG GTCTTGTCCT 6120 

CGAAGGAAGG AGTAGACGGG GAGAAAGAGT ACCCCAGCTC GCCGAGCTCT CCTGTAGGGA 6180 

TGAGCGAGTC AAAGCTGTCA TCAGTTTCCT GTACGACATC GCCACCCGGC GACGCAAACG 6240 

AAGGCTCGAC CACGTCATCT AGCGTCAGAT CCACATGAGG GATGGGCATC TTCCCATCTT 6300 

CTTGAACAAG GAGGTCGGGA ACCACATATG CAAGAAGGGC CCGCAGCgcG TACGCGGTAA 6360 

CTCCTGCGCC TAAAGCAAGG ACAGTCGCAC GGGCGACCGA CACATACACG CGygCGcGct 6420 
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acCGCCGCTG TTGCAATGGA TAAAAAGAAC GCAGCGCCAG CTGCAATcGC AGGTACCTTC 
AAGGAGGCAC CAACTGCAGC ACAGGAAAAA CGCCGCTCCT GCACGTCCAC GGAGCTAGTC 
TGCAGCTGGT TTTTTGTTTC GTCAAGGACC TGTTTCCAAG CGTGTAGTGT GAGCTATGCG 
CGGCACACCC GCATACCACG CGGTGAGTGG TGTGCCGTGC AGCGCTTGCA CGTGTACACA 
GGTGGCAGTA CAATTGGCCC TCTCTTGGAG GGGGAGTATG GGTCGTTTGA AnCGGTGTGA 
GGTTCGTCGC CGCCCGTGCG CGCTTTGGGC GATCGTACGC n 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19217 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
AGTGTCCGTT TTGTTTGGCG TGGTGTG CG A TCATAGTTGT AAAAAGCTCC ATGCATACTC 
GAGCGTCATC CTCTGCTCTG TGTGCTG CGT GTACCGTAAG TCCAAACTGA AGTGCAAGAT 
TCTGCAGACG GTACTGATGG CGTCCTAACC CGGGGAACAC CGCTTGGGCC ATCGCGTACG 
TATCAACTAC TTTGTGAGAC AGGGGTTGCT TTTTG C AC AG GCTGAGTTCT GCATTGAGAA 
ACTCGACATC GAAGTTTGCG TTATGTGCGA CGAGTACTGT CCCTTTGATG AATCGAGAAA 
AGTCTGAAAC TATCTCACAA AAGCGCGGCT TATTGACGAG CATATCGTCG GTAATATGGT 
TGATTTTGCT CACGTCAGGG GGTATAGCCC GATCAGGGAA GATGAGCGTG CTAAAGCGCG 
CAATAAT AC C CTTTCGATCA AACGTTACTG CACCAATTTC TATAATGCGA TCTTCTTCTG 
CTTTTAAACC AGTTGTTTCG GTGTCGAAGG CGGTGAATGC AACGTGTTCA TGCACCGCAA 
AAACCCAATC ATATATCATT GCAGATATGT ACCCATCTCT TGTTCAACTG CGGTGATAAA 
CATGTTGCCC GCCTGTTCTG CTTCTTCTAT CGAGGTACTG ACGGTCAGAG GGTGGATGAT 
ATAACACTTT ATTTTTTGTT CTGTTCCACT CGGTCGTATA CTTACAATAC TGCCATCCTG 
TACAAAATAC TGCAGCACGT TACTCTGAGG AAAAGAAAGG GCAGATGTCT GCGCAGGATT 
TTCTGGACTA AACTCAACAC CAAGATATAT ATCCCTCACC TTCATTACCC GCTTGCGCGC 
AATTTGGGTT AGCGGCTGTC TCCTGAGCGT ATTCATTATT GCATTCATGG TGCTTACACC 
CGCGACGCCC GCATAGGTTT TGTTCAGCGT CTTTTCACAA AACAGGCCAT GCGTCCTGAA 
TAACTGGTGA AGGCGATCGA TCAGGCTCAT TCCGCGCAAC TTCCAGTACA CACCCATTTC 



6480 
6540 
6600 
6660 
6720 
6761 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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TGCACAGAGC GCTGCGGCGT TGATACCGTC TTTGTCTCTC ACCTGAATAC CAAAATTGTG 
TCCGTAACTT TCTTCAAATC CATATACGTA GG AG TACGCT CCTGACTGTG AAATCTTTTC 
TGCAGTACCA CATATCCATT TGAATCCGGT AAGGcACTCT ACACACGTTG cGCCATATGT 
GCGTGCTATA CGGTCGCTAA GTGGGGACGT AACAACGGAG CGTACAATTG CAGGACGCGC 
GGGCATATTG TTTTGTTCCT GCAGGGTTAG C AGAATG TAG TCAGTGAAGA GCGCTCCCAT 
TTGATTGCCC GTGAGCAACT GCAACACACC GCGGGTGTTT CTTACTGCAC ATGCAAAGCG 
GTCTGCGTCA GGATCAGTCG CCATAAGAAC CTCAGCATGT ACGCGATCAG CATATGCACA 
CGCATGCACC AACGCGGCCG GATCTTCTGG ATTAGGAGAC GACACCGTAG GrAAGTTCCC 
ATCTGGCAAC CGTTGCTCAG GCACGGTCAT AATGGAGAAC CCCATATCCC CCAGTATGCG 
CTCGACGTGG AGTGCACCCG TTCCGTGTAA TGGGGTGTAT GCAATACGCA TCGACTGGAC 
GGTCTCTTTC GTAAGACCGG GGCGAAAAAG CTTTTCCTTT ATAGAGGTGC AGTACGGTTC 
ATCAATTTCT GCATCAATGA TCGTGGGTgC ACTGCGTTTG ACAGGTACCT TTTCCTCAAG 
GTTCACGACA C TC G TG AT AG CGTTCATTTC TTCGGTGATA TTTTTTTCGT GAGGATGCGC 
TATCTGTGCC CCGTCGTTCC AGTACACTTT GTATCCGTTA TACTGCGGTG GGTTGTGCGA 
TGCGGTGACC ACGATGCCCA CGTCACAGGT AAGATACTGT AC tGCGTAGG AAAGTTCTGG 
AGTCGGGCGT GGATCCGAAA AGAGGTAGGC GGTAATGTCA TGTGCAAGAA ACACGTGCGC 
AgcAGTGTGT GCGAACAGAC GAGAATGTAC ACGCGAGTCG TAGGCTATAA CGGCACGGaG 
CGCGCCGCGC GCTGcCTTTT CAGGAAAAGT TTTTAGTAAA TAGAGCGCAA TCGCGTGCGT 
GATCTTTTTG ATCATGAAGG GGTTCATTCT GTTTGTTCCT CCGCCGACAA CACCCCGCAG 
CCCGGCGGTG CCAAACGAAA GAGTTTGCAA AAAgcGCtCT TCGAGCTCTG C TAT ATT ATT 
CTGTGCAACA AGATCCCGTA CCTGCTGTGC AAAGAAAGGA TCTGTTTCTT CTTCAAGATA 
AAGACGAGCA CGTTCGAACA ATTGACTGGA GTGCATGAGC GCTTCCTCAC CTTTAAAAGT 
ACTGGACTAT TTACGGCACC ACAGGATAGA GGGGCATTGT AATGGGAAGG TGCTGCTCTG 
TGCAATGCTC ACAAAAAGTG CATGTCTTGA AAAAGTGTAC CAGAGCCACT AC AC TGGTGC 
GCGTGGGTTC TGCTGTTTCT CCGAAAGTTT TAAAAGGCTT TCGCGATCTT TTACCGGATG 
AAGAGATTGA GCGTGCATTG CTCGTAGAAA AACTGACGGT GGCTTTAAGA CAAATGGGTT 
TTGTACCTAT CGATACCCCC GCGTTGGAGT ACACCGAGGT TTTgcTGCGC AAAAGTGAGG 
GTGACACAGA GAAGCAGATG TTTCGCTTTG TTGATAAGGG TGGAAGAGAT GTGGCCCTCC 
GCTTTGATCT TACGGtGCcG CTTGCGCGGT TCGTTGCAAC GCACTATGCG CGTTTGTATT 
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TTCCTTTTAA GCGCTATCAT TTTGCAAAAG TGTGGAGGGG CGAGAAGCCT CAGATGGGTC 2820 

G TT AT AG AG A ATTCACGCAG gTGATTTTGA TATCGTCGGT TCGGATTCGG TGTGTGCTGA 2 880 

CTTTGAAATT CTAAAGTCGA TACGGCACAT GTTGTATATG GCTGGTGCAG AACACATACG 2940 

TATTCACGTT GCGCATCGTG GCCTGTTTGA TCGTTTTTTG CGTGCTCTTT CTTTGTCTGA 3000 

CCAGGCTGAG CATATCCTGC GGATAATTGA CAAACGTGCA AAGATGGCGC CGCATGTGTT 3060 

GACAGCTCAA CTTGAGTCGC TTTGCGATCC AGTTCGTGTG CAAAAGATTA TGACGTATGT 3120 

AAGTGCGGGG GAGGTGGACG GTGTTGCGCC GTCGTTTGAA CATACATTGT CTGCCATTGA 3180 

GACATTGACA GGGGGTGTCT CGGAAGAGAG TACACGG c TT AGAAAAATAT ATGAGCTACT 3240 

CTGTGCAGTG AACATTCAGT CCTCTTATGT GTTCGATCCA TCTATCACGC GTGGTTTTGA 3300 

TTACTACACC GGTATGGTGT GTGAAACGTT TTTAACACAG TTGCCTCATA TCGGTTCGGT 33 60 

GTGCTCAGGT GGGCGCTATG AC C ATCTGAC GGCTTTGTAC ATGAAGGATG CAGTGAGTGG 3420 

GGTGGGTGCA TCCATTGGGT TGGATCGCTT GTATGCAGCG TTTCAGCAGT TGGGAATGTC 3480 

CCGAGAGCAC GTTTGTTTTG TGCAGGCGCT TATCTTCTGT CAGGATAGTG CGCTCATGGA 3540 

TGTGTACCAA AAGCTGTGTT CATACTTTGC AGTGCAGGTG GCGACGGAAG TCTTCCCTGA 3600 

TCCGCGGAAG TTGAGCCAAC AGTACGCCTT TGCAGAGAAG AAGGGGATTA GGTGGGGGAT 3660 

CTTTGTTGAA CAGCGCAACG CCGTGGTGGA GGACTGCCTG CTCGTACTGC GCGACCTTTC 3720 

TACGCGAAAG GACACACGCC TACCTGcGCA CGAAcGgACC GnCATGGgCA GCTGAAGGGT 37 80 

AACAGGCGCC CCCGCGACTC TAGAGTCGCA TGTTACTCAA TTCAGTGACT AGGTCCGTTA 3840 

TGGAATCCTT GTTCTTCTGT CCGATTTCAG TGATTGAGGA AGCATACTTT TTAACCTGTT 3900 

CTGAGTTTTT GTGCATGGAC GACACGCTGC TGTCGATTTC AGACGTGATG CGCGAAAGGG 3 960 

CTAGCATCGC CTCCTCTACG TGCTTGCTGT TGTCCAAGAT GGCGCCCGAA TTTTCCTGCA 4020 

ACGTGCGCGT GATCTCGGTG ATGTGCTGTA TAGCGTGCAA CACGTGCACA CTGTCTTTCG 4080 

TCTGTTCTAC CATCGACTGG CTGATCACTT CTTCTTGGGC CTTTATGTCT GTGGTGATAG 4140 

AGAAAATCAG CGCAAACTGA GCCTGAACTG CAAGCGCGCT CTCAGAAACC TTTTCAATTT 4200 

CCGTTTTTAC GTCCCGCAGC ACTGCGGAGA TATGCTTTCC CTGTTTCGAA GCGTCCTCTG 4260 

CGAGCCTACG TATCTCACTC GCCACTACTT CAAAACCCTG ACCTGACTCT TTACAGTACG 4320 

TTGCTTCGAT TGAAGCGTTC ATTGCAAGCA AGTTAGTTTG ACTTGCGATG TGCTGAATTA 4380 

CCGCACCGGC TTCAGCCAAC GCTTCTGAAG CGTGAAGCAC TTCTCTTCCC ACATCTGCAG 4440 

ACTGAATAGT TGCGTCCTGC GCTAATTTTG CCTCAAGCAG CAGGGTTTCA ATGACATTAC 4500 
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TATTGTCCGA AAGTACCTGA GTAACTGACT CGATGTTTCT CACCATCTGC TCTACGGAGG 4560 

AGGCGGCATC TGTTACCGTA TCTACCTGCT CGCCGATGTG AC CGTTAAAC CGATCGATGT 4620 

TTCCGACAAT GTGCTGCACG TGCTTTTGTG TTTCAAAAAT GGTACGGGAT TGGTGTTTAA 4680 

TTTTGTCTTC TGCCTTGTGT GCTTGGGAAA TGATCTCGTT GATAGCGGCG CAGGAAGTGC 4740 

GCGCATTTCT GACAAAACTA TTCGCGATTT TCCGAGCGCG TGCCATCTTT CCACGGGATG 4 800 

CAATGAGAAA GAAACGCGTG CTCTTGTGCA TCTTGTTCAA ACTTTGGGTG AGGAACCCAA 4860 

ACTCATTTCT ACGCACAACG CGCATACTCG CTGCGGACGC ATCTCCGCTC AACCTGTTTT 4920 

GGATAAAGGT GAGGATGCCT TGGTTTGCTA AATCGAGGTG TCGGCCAATC CTGATCATGA 4980 

CGATTACGCT GAGTACGCAG ATTAAAGACG CGCACGGGAG GTTATGGGTT ATAACCACAT 5040 

GTGCGAGCTC CTGCTCACCG TGCAGAGGGA GTAGGAGCGA TGAAAAGCCC ACGCAGAACA 5100 

TACTAAACAA ACCTGAAACG AGCACGACGG ATATTTCTCT TGAGAGGGAG TAGTACCCGA 5160 

GGCGTGTCTC GCGCAGGGGG ATAAAGTGGG CCCACTCCTG CAGTGTGTAA AAGAAAGGAT 5220 

GGTGAAAGAA GGGAATGAGG CAGAGCGCTG CGCCCACGCT GC TGAGGT AG TAGGACAGCA 5280 

AGAGCACATG GCTTGAGGAA AAACCGACTT CTAGGGCAGC ACACACCGGA TACACCACCG 5340 

CAGCTAC TGC AGGAGGCAGG GGAGAAATCC ACGTGTACCG CAGAAACGCA GCCTCTGCAG 5400 

CTGCGCTATC GTTTTGATGT TTCTTAACGG ACATGAGGAG AACGTAgTAG AGCGTCAgcG 5460 

AAGCACCCAG CATGAGCGCC AGcgCACAGA AGAAGGAGAC GCTGGTGAGC AGAGCGgAGA 5520 

GCATACTCCC GTCTATGACG CCTGCGATAT AGGCTACGAG AGAAGTCGCC GGCACCCACG 5580 

CAACGCTCAT CACGACGGCG CGCCGCAGCA CGGCGAAGTC GGAGACCGCA TGTTCAGTGT 564 0 

TCATAGGTTA CTCCATGGAA AGGTGGACTA CAGGCGCAgc TGGGTGAGCT GGTTCGTAAG 5700 

CATGTCCATT GCATTCTTAT TCTCTGAAGC GATAATCTTG ACGGAGGTGA CCGCATCCAA 57 60 

GATACGAGAC GTGCTGGCGG CCATGTCGTG CATAGCCACG GTGATACCGC CTGTGTTGTG 5820 

TGCAAGTTCG TTCATATCCT TGAGGACCAG CTCTCCATCC TTGAGCATGA GCGAGATACC 5880 

GCCGCGGATA CGTCTGCGGT TGCCTGTAAT AGCCTCCATG GACCGCCTGA TGTACTCACC 5940 

CCCTGAAGAC TGCTCCTCCA TGGCATGCAT GACGgTaGAT TCCCGCTGTT TAACTTCTTC 6000 

TGCAAGGACA AAGAGATTGG AAAACTGCTT GCTCAGGCTC TCCGCCtGcT GCGCGATATG 6060 

CTCTATCTTT TC AG TGAGTT GTACGAGGAC TGAAGAAATG ATCTTTCCCT GTTCGTTTGA 6120 

TTcCTCTGCT AATTTCCTAA TTTCGTCGGs GaCAACTGCA AACCCGCGTC CGtCCTGACC 6180 

CGCGTGAGCA GCCTCTATCG CGGCATTCAT AGCAAGCAAC GTAGTCTGAC TTGCGATCTT 6240 
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CCGTATTACC AAGCTGGCCT GTAAGAGTGC CTTGGAAGAC TTGGAAATAT TCTGGGTAAG 
CTCCAGCGAT CGGTTGGTGA GGTTCCCTGC GCGGTTTGTC TCCTCTCGCA ACCGTCCGAC 
ACGCGCGTCG TTGCTTTCGA TAATCTTTTC GATAGAATGG ATTTTCTCTA CCATTTGCTC 
GACAAGGGTG ATGGACCGGG TGACAGTTTC CGACTCTGCC TCGACGCTTG CATTCAGACG 
CCCGACTCCC TCAACCATAG TCCCCACTAT TTCTTTAGAG GAGCTGATGG TGGCAGTCTG 
CGCCGTCACA TTTTCGTTGA CGTGATCAAT CTCCTCTTTG GTGGTGCGAG CTGCCTCCTC 
GAGGTGAGCG ATATCCGTCT CTGCAGTGTG TGCAATTGCG CTTGAACTCT GTGTCAGCTG 
CAAGAAGCGG GAGAAAAACG CGCGATCTTT GTCGCGCAGG GAGCTCAGTG ACCTCGCCAA 
AAGACCGATC GAGGTGCGGG TTTCTCTAGG TACGTCCGCA CAGGTGTAGT CGCCAGTTCC 
GATGCACTCG GCGAGCACGC GCAAACGGGT GATTTCTCGC GAAGTAAGGT ACGCGCACAG 
GAG AAACTC C ATGCAGCTTA CGGCGCTGGC ACAGAGTCCA AACGTACCCG CCATGGTGAG 
CAGTTCTGCC GGAGTAATGT TCTGGC T ATC TGTGCGTACA AGGAGTGGTG CAGAGACGAG 
GAGTAACGTG GCAACGAACA GCGCAATCGC TATGGTAAGA AGCATGCGGG TCCAGTTTGC 
GCGCACAGAA CTTTCGCGCA AAGGCAGGAA GGCAGCCCAC CGCTCAAGCG GACGCGTAAG 
GAGGAAATGC AGCGCAGGAG CAAACAGCAG ACAGCCACCG GCAGAAAGCA TAACATACAC 
TGCCATCTGT GGACCTGTCT CAAACGGGGA ACGCGCGAGG ATAGAGACGA ACGGGACGAA 
GCAGCTGAGG AGAACCGACG CACGGAGCGC CGTCCTCCCA TAGCTGTTGA CGCGCTTGCT 
GAAGCGcGCC TCATCTTCTC CACGAATAAA AGCGGCGCAC AGCgCGCGAt AATGGAGAAG 
CGGAAAGAGG CACAAGAGGT ACAGCACGCT AGCGGGAGAA AGGAGGAGTT CAACGAACTG 
GCCATCGCCG AAGrgAsCCG sCGCAGAAGC GGAGAGAAAC AAGAGTGGCA CCCAGCAACA 
GGAGAGCACG AGCTCGCGTA C AAT t AC ACT GATCGGAGGA GTACTATAAG TACCGGCGTA 
CATGGTTGCC CCCTCAGGCA TAAGTTCACA CGAGCGCGCA CGGTAGCATC GGAGAGAGGG 
TCTGTCAAGC GCGCACAGGC CCTCGCCCGA GGGCGGCAGC ATTGCCGTcG GACAGAATCG 
AACTGTCGAC ACAAGGATTT TCAGTCCTTT GCTCTACCGA CTGAGCTACA ACGGCGCACA 
CCGCGCGCAC CTGnTCATAC AGAAAAAAAA AGGTCAAGTT CCTCACGATG CTCGAGGAAA 
AGCAGCACAG GAGAAGGAGA ATGCAGGTCT TGACAGGTGC GCATACTTTG CACTAGGCTC 
CCGCCGGAGC GGTGGGTGTA GTTCAGTGGT AG AGCGC C AG ATTGTGGATC TGGTTGTCGT 
GGGTTCGAAT CCCATCACTC ACCCTGTGCG TGCGCAGGCG CTCGTAGCTC AGGTGGATAG 
AGCAACAGAC TTCGAATCTG TAGGTCGCAC GTTcAAGTCG TGTCGGGCGC ACTTGTGGTG 
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TCGCTTTCCT CGTATAAATG GTTCTGTTCT CGTCTTTCCT TTAACACGCT GAGGACGTGG 8040 

GGGTGCTTTG AGTGGGTACG TTGAGGCGTT TTCCTGCGTC TGTTCTCCAG ATAGCGCTCG 8100 

CGCTCTTCTT GCTTGCAAGT GGTGCACGAG ACCTCGTGCA TGTTGATGCG GGTGTGTTCA 8160 

ACGCGGCGGT GTATTTCCTC GGCGGATTGT TTCGCGGTCA TGTGGCAATC GGTGTGTTAA 8220 

CGCTCGCCGT GTCGCTGTGT TGTCTGACGG CGGGTTTCTT TTTGCTCGTC GATTTTCTGC 8280 

GCCCAGAACT TTCCTGCGTT TCTGCGGTTT TAGCGCTGTT CGTTGTGCTG TGGGCTCTGA 8340 

ATATGGTTCT GGTGGATGTG GTGGGTGCGT TCGGCCGCGG CAAGGTATTG CAGAATGTGT 8400 

CCTCGGCGCT TGAGCATTTG CATCATACTG CTGTCGATCT CCTTGTGCTT GGAGCGCTCA 84 60 

TCTTTGTGAG GCAGCACACG CGTTAGGCGT TTACACAGCC GAGGATGACA CACGCCGCGT 8520 

CTTTTTGCTT CTTCTTGTGT GCTCTTTAAG GAGGGGTCAG GCGGTGTCTC GTGCCCCGCG 8580 

TGTGTTTTTT AAGCAAGAAA AAGGAGTGGG GTGAATGGCT GTGGCGGGTT CCTTTGAGCG 8640 

GAGTAGATTG CCGTTACGAG CACCATGAAC GCGCCGTCGG AGTGGCGAAG ACAAGACTGC 87 00 

CCGGTCCCAA TGCGGTTGGA AACGCAGGCA CTTGTACCGT ACCCTGTTCG CTTTGACCGC 87 60 

AGCCACCATG ATGCGCTGGT GGTCCTGGGC GCTACCGCAA CAGGTAAGAC AGCGTTAgcA 8820 

GTTGCGCTTG CCCAAAAATA TCAGGGGGAA ATTATTTCCG CCGATTCGCG GCAGGTGTAC 8880 

CGTGGTCTGG ATGTGGGAAC GGGAAAGGAC TTAGCTCTGT ACGGGTCGGT CCCCTATCAC 8940 

CTGATAGACG TGTGTGATCC GTATGAGGAA TACAATGTTT TCCGTTTCCA ACAGGCAGTA 9000 

TATGGCATAG TGCCGAGTAT ACTCCGGGCG CACAAGGTGC CAATTATTGT CGGTGGTACG 9060 

GGTTTGTATC TTGATGCAGT GCTGCGTCAG TACGCGTTGG TACC TGTTGA AAGAAATcAG 912 0 

GyGCtGCGCC ATTCgCTCCG CGgAGCTTCT CTGTCGCATA TGCGCGCGGT GTACTTTTCG 9180 

TTAAAAGACT CCC ATGCTGT TCACAACAAG ACAGATTTAG AAGATCCTGC GCGTTTGATG 9240 

CGCGCTATTG AGATTGCTGT ATTCCATGCA ACGCACCCTG AGCTGCTCCA GCAGGCACGG 9300 

GAAACGCGCC CGATGATGCG CGCGAAAGTG TATGGCATAC AGTATCCACG CTCTATGTTG 9360 

CGTGCTCGGA TTCGAGCACG CCTCGAGCAG AGAATACGTG GGGGACTGAT AGAGGAAGTG 9420 

GCAGCGCTCC ACAAAGGCGG GGTTTCCTGG CAGCGTCTGG AATACTTTGG G TTGGAAT AT 9480 

CGCTTCACTG CGCAGTATCT ACAAGGGATC ATTGCTACCC GTGATGAATA TGTCGACCTA 9540 

CTTTTTAGAG CTATTAGCAG ATTTGCAAAA CGCCAGGAGA CGTGGTTCCG ACGTATGCAA 9600 

AGACTCGGGG TAAAAATTCA CTGGCTCGTG CATAsGGAAA ACGGTTTTGT TCTCCGGTGA 9660 

AAAAACGATG ATCGCTCATC GCACCGCTCC ATAGGGTATG TGTGTGGCGA GTCGCTCGTG 9720 
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GTACGCACAG GCGTTAGTTC TTTCTCTTTC 
GCTTTTTGTT CGCCTGCAGG ATGAAGCGTT 
CCAAAGCAAA cTGCGTGGAT TGGTTATTCC 
GAATCCGCGT ACGAGTATGC ACAAAATTGT 
CCTCCGTGAC AGTATCGTAA GAGTGTGGTG 
GAGCAAAGGG ACCATCAGCT GGACGCTGGT 
TTCGGACGGG GCATGTAAAC ACGCGCTTTT 
AGGCGTTGCA CCAAATATCA CATCGTGTAG 
AGTGGTGTGT cGCGCGTGGC GTATyTTAcG 
GCTGCTCCTG CAGAACACCG GTTTGAGCTA 
GTCAAAGAAT GCACTCCAAG GCACGCGCGC 
GAACTTAAGC AGCTGCTATT TTGCCtGATC 
CTCGAACATG CgCACGCAGT GCTATGAGAC 
GTTAGGTCCC CGGGGTGCAC AGAGCGAGAA 
ACCGAACAGC GTTCTGGCAT TTGCGTGACT 
TCAGCTGTGT ACTGACAGCC GTTTTCACTG 
CCTGGGCGCG TGTCATACCG TCGAGATCGC 
GCGCGTGCAC GTGTGCATCC GTATGCAAAG 
GCGTCTCTTC CGAAATAATT TGGATTCC C A 
ACTGTAAAAC ACCAGATAGT CAAGCTGGGC 
CGCGCGCACT TTTTCTTGAG CATGTTTGTG 
AAAACTCTTT CCTTGCAAGG GGTGTACGGT 
TTGATCTAGA AGCCTGCAAA ACTCGTCTTC 
TTTTTTTAGT TTTTGCGCGG TGCGATAGGG 
AAGGTCTCGA GTAACTGACG TTCTCACAAC 
CGCGCGATCT CCATTTCTGA GGGAGCCTGA 
CTCTGCCTGC TGCTTGGCAA GAATGCGCAC 
ATCCTTAACC CGTGCAATAC CGGTTACCAC 
GATGGACTCG AGTGTCTCCG TTGTTgCGTG 
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AGCATTTGGA GAAGGGCACA 
TATTTTACGT GCCGCGCTTT 
CTATACGACA GGTCGGGTAT 
TGACTTCTCA GTTAcACACT 
CGCTGCGATT TGCGTAGACA 
GACTGCATTT TTGGATGGAA 
ACGCTTTTTG TGGCGAGTGC 
CCGGnTGTtA CGCAGTACGC 
CAGGGTGAGT CTTTTGTGTG 
AACGCGGAGG CGTGGCACTT 
GCGTTGGTGT TGTCCCAAGA 
AC G AAAATG A GCGGTAAAAA 
TGTCCGAAAC AT AC CGC AGG 
AGACGTGCTC ACTGATAATA 
GCCCATGTGG ATCGTTTGAC 
TTCCGCCTAG GCGTTGTACG 
CAGAGAAGCA GAAGCTCATG 
GCGATTGGGC AGTGGCAATA 
ACGCGCATGs GtGCGATAAC 
TGTTACGGAC GATGCGACAT 
ATTCATACTG CGCGACGCAC 
TTCCCCCGTA TGGGTACAGG 
CGAAATAATT TCTAGCGCTA 
CTGTGATAGG CTTTCGAATA 
GCCTCCGAGC GCGCGTATAC 
GAAGCAAAAG CTTTTTCCCC 
GAAGCCTCTA TCAAGCAGTT 
GCTTTTTGCA AGTTCTGTTC 
GAGCACTTTT TCAAGTGTGT 
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AAAGCGTAAC 97 80 

TTGGAGGTGC 9840 

GGGTATATTC 9900 

CTCGTGTGGC 9960 

TTATAGAGGC 10020 

TCAGTCTCTC 10080 

TTATAGGAGA 10140 

CGTGTCCGTA 10200 

TGCTGCGTGT 10260 

TCTCAACACG 10320 

GAGCTACTGT 103 80 

GTTAAAAACA 10440 

GTGCAGTCCA 10500 

CGCACGCCAC 10560 

ACCAGATACG 10620 

AGCGCGATTG 10680 

CCGTGCAATG 10740 

AGCTTGTGGA 10800 

GTGTCGACTG 10860 

CCCCTCCTAA 10920 

CTGAGAAATA 10980 

ATGCGGAAGC 11040 

CCCCCTGTTC 11100 

TTAGGTACG A 11160 

GGTGTATAGT 1122 0 

GGAGTGGCGA 11280 

CACACATATC 11340 

CAAACTGATA 11400 

CAAAACCGGC 11460 
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ACAGATAAGT TTTTCTCCCA TGGTTTCTCC GATCCCCTCG ATGCCAAACC CGGCAATAAA 11520 

CGTTTGCAAC GCTATTTCTT TTTTGTGGTG TATAGCTTCG AGTATTTTTT TTGCAGTTGC 11580 

ATTGCCGACG TGCTCAATCT CAATGAGATC CTCGCAGGTG AGCGTGTAGA GGTCCGGGAT 11640 

GCGGCGTACC TTTTTTTCTT CAAAAAGACG CTGAATGAGT TCGGTCCCCA CATGCTTGAT 11700 

CTCCAAACAC TCAATCCACC GTGTAATGCG GTGATGCGAG AGCAGGGGGC AGTTTACATT 11760 

AGGACAAAAA AGCCTGCTCC CACTGTTTTC CAGCACTGTG TTGCAACTGG TACAC TGTGT 11820 

CGGTATGTGG ATTTCCTGCG CATGTGCTGG GGTTGAAACG AGTGCTTCAA TTTTTGGGAT 11880 

AATCTCCCCC CGTTTGGAAA TTAAAACGTG GCTGCCAATT TTG AG AC AC A GCTTTGTGAG 11940 

CATATTCGGG TTACATAAAT TTGCGCGCTT GACCGTAGTT CCTGCAAGGC GCACTGGATC 12000 

GGTAATACCA ATGGGCGTGT ACGTCACTCC TGATGTTTGC CATTGAACGT CACGCAAGGT 12060 

GGTTATCGCC TCC TGTGT AC TGAACTTAAA GGCTATCTGC TTTTTAGGTC TGGGAAGTTG 12120 

TGCGTCCTGG AAGTCAAGAT CGGTACTCTT TACTACCAGG CCATCGATGC TGTAAGGCAA 12180 

CAGCTCGCGC GTGCGCATAA TCTCAGATCG GAGTGCAACA ACTTCCTGTG CGTTAGCGCA 12240 

GCGATGCGAA TGTACCGTCA CGAAACCTTG GCGCGCGAgC CAGGCAAGCT TTTCTGTTTC 12300 

ATCAGCAAAG GGGAGGGAGC CGGTGAACGG TTTACCGGGG GTGCCGGGTA cTGCGTCGTA 12360 

ACAAACAATA TGGAGGTGGG TGCGgCCGCG GCCGTCCTTT CGCTTTAGGA TGCCGTTTAC 12420 

GGTGTTGCGG CAATTTGCGT GAGTAGGATA GTGGGAACGA TGTATATCCT TGTGCATAAT 12480 

GACTTCGCCA CGAACACCCC CCGTGAAGGG GAGATTGCCG CAAGGTCCCC ACTCTGCAGT 12540 

GAGGGTCGGC ACAAAGCCAC GCATGCCGCG TACGTTAGCA GTGACGTCGT CTCCGACAAT 12600 

GCCGTTACCA CGGGTGAGCG CACAGCAAAA ATGACCGCGC TCGTATTGCA ACTCTAAgcT 12660 

AACGCCATCG AGTTTGTGTT GGACGAGAAA TGCCTGCAAT GCATTTTTTT TTGCCCATGC 12720 

GCTGAAGGAC TCCTCGTCTG CAGCTTTGTG TTGACTACCC ATAGGAACAA TGTGGCGCTT 12780 

TTTCACTGCG TCACGTTGAC TGTCAGAACC GATTGCTTTA AGCAGCGGAT TTCCAGGATC 12840 

AAGC CTTGC A AGTTCTTCCC AAAGCGCGTC AAAGGCGTCA TCTGAAATAT CAGACTCCGC 12900 

GTTGTAGTAG CGGTCTTGAT GGTGAAGAAT GAGCTTTTCA AGTTCTTGAA CACGTCTCTG 12960 

CGCAGTACTC ATAGCACAAG GsCGCAATGT GTAGCGTCCG GGGCGC AC C T CCCACGCGTG 13020 

CAACGCTCCT TCGTCTTGTC GGAAGATGCC AAAACGGCGG CGCAGTCCCC TGGGGGCGTG 13080 

AGTGCAGCGG TCACTCTAAT GGCTCTGACG CCAGGTGGAG CGACACGTGG CAGTAACCAC 13140 

CAGGATTCTT GGTCAAATAG TCTTGGTGAT ATTCTTCTGC AGGGTAGAAG TCACGCGCCT 13200 
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CTTCCAAAGT GGTAACCAAA GGACGGGTGA 
GCGTCTCCGC TTGCTGCTTC TGCGTACCAC 
TGCCAACGTC TCCTCCCTGT TTGTTAAGAC 
TGAGCAGATC CTCATAGGAA ATAACCTGGG 
GTCCGGTAGT ACCGGTGCAG AC GTTTCGAT 
ACCCGACTCG AACGCGCAAC ACTCCCTTGA 
AGcAc tGCGG CGAAAAGAGC AGTCCCCTCT 
TTGATGCAGT AACGGAGCCC CCCGCGTTCG 
TGAGAATTAG CGTTCCGACT CCGCACCTCA 
TTTTCAATCA CCGCGCGCGC GTCTGCAGGT 
TCGAATTTAT CTTCGGAGAG GAAGAGGAGT 
GGTTGGTAAG AGTTCCAGAA CTCATTCTTA 
ACACGGTACT GAATATCACT CAGGTGCTGA 
ATGAGAGCCA GACCTGCAAG GcACCTCCTC 
GAAACCGGcG GAAGCGTGAT TGCGTTGCAG 
TCCCTACGGC AGATGAACGA GAGGACGTGC 
CATGGTTTTG cGTGCTTCAG CACAnTGcGC 
TACGCAGGCT GCTGTTCTAA CTGTGCACGG 
AAGGCGCGCG CAAGAGCGTC TTTCACTTCT 
TCTTTAAAAT GTGCAACCTC GTCCGTGTTT 
GGATTCCCTT CCACGCGTCC TGGTATATCT 
GCACGGACCT TGCGCCGCAC CGTCTC TTCA 
CTTTTAGACA TTTTCGCTTG TCCGTCAGTT 
GCACGCGGCA ATGGAAAGAC CTCCCCATAG 
GTCAGTTCTA CGTGACTTTC ATTATCTTTA 
AGAATGTCCG CCGCTTGAAG TACGGGATAT 
TTTGCCGCTT GAGCCATCTC CTTTAAGGAA 
AGATTCGCAA AAATGAACGC CAGCTCTGTA 
ACCGCACGCT GCGcGTCGAT GCCACAAGCT 
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ACTTTCCCGC GCCTGCATAG 
TGAGGTAGAA AATTGCAGAA 
TAGTTGGGTC ATGCATGCGG 
GATCAAAGAG GATCTCTACC 
AGGTAGGACT TTTGGTGGTA 
CGCGCCGAAA GTAGGCTTCC 
TCCCGAGCAA CAAAGCGCAA 
GGCGGACCGT CTTTGAATAC 
GTACGGAGCA TGTTGTGCGT 
GCAGAAAAGC TAGGCCAGCC 
TCTCCTGAAA CAACATCAAG 
AAGGGAGTCT CAGTGGCAGA 
AGGTTTGCAA GCATGGGGCC 
GGCGGCAGGA CTCGGCGCAC 
GCAGCCGTGT TGATGCGCGG 
AGAAAGACCC ATCGCCTCTT 
GCGCAgcCTT cCTCTAATAC 
CGTGCGCGGA TGGGTTCTAA 
GTGTCTCCCA CGCGCCCAGc 
GGGTTGAACG CATCGTGGTA 
GCCCGGGTGC GCGCAGGATC 
TCGTCCGAGA GAAAAATCGC 
CCCACGAGAG TGsCGCAGTC 
AGGCGATTGA AACGCTTTGC 
CCTACCGGCA CAAGATGCGC 
CCCAAAAGAC CAAAGGGAAG 
GGGATGCGCT GCAAACGCGG 
ACTTCTGGCA CCGCcGATTG 
AGGTAGTCCA GCACGAGCTC 
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CGACCCATGA 13260 

CGGTATTGCG 13320 

AAAAAGTGCT 13380 

GCTTCTGCGT 13440 

CCGCCCGTGT 13500 

GTACTCC AGA 13 560 

CGCTGCAGAA 13 620 

GTGCCCGAGA 13 680 

TCGATCCTCC 13740 

ACAGCCTGAG 13 800 

ATAGAAACCG 13 860 

CTGCTGGGTA 13920 

ATTCTAGGCG 13980 

TGGTAAGAGG 14040 

CTACAAGCGT 14100 

TTACTCTCTG 14160 

GCGTGCAAGG 14220 

AAACGTATTC 14280 

GrATTACCGC 14340 

CGCAAACACC 14400 

TGTGTACATC 14460 

GTTCCCCAAA 14520 

ACTGAGGAGT 14580 

AACTTCGCGC 14 640 

CCGCGCCAgC 14700 

TTCAGAAAGG 14760 

CACCGTTACC 14820 

CAGATAAATG 14880 

ACGAACGAGA 14940 
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GCAGGCAGCT CGGCAAGCTG TGCACGCCCA 
ATGATGAAGA AACACTcGTG CTCAGACTGT 
TAGTGACCGA GGTGGAGACG CCCCGTGGGC 
GACTGTAAAT CAAGGTCCGT CCAGTATTCA 
CCGCGCAGCC CTCCTCTATA TCACGGGTGC 
GCCGTAGCGA TTTATTGTCG CTGCCATATC 
AAGTTCGCAG GCTACACGCG GC ATTC CAT A 
TGTACGCGAG CCTTCTGTGT ATAAACGCGc 
GTCATTAAAA TTCCCAATGC GCGGTTTTCA 
TCAACGCTGG GTCGGTGACC GTTTTCCGGT 
AGGGAGCGCC GCTCTACCGT CAGATGGCGA 
CGCACCAAGT CTCCTTCTTG CGCTTCTTTT 
TACGCAAATT CACGCGTGAA ACCTGCAGGC 
AAATCTGCAT CGAGCTGTGC AAAGATATGA 
CCGATTGCAA TGATCTGCAG TGCCCCACTT 
GCTC CCTCTG TGGGGGTGAT TGTGTAGGAT 
ACAGACGGCA GAGAGGCTCG CTCTTGGGTA 
GGTCTCTCAA CAGGTGTGTC CATGGCAAGA 
CGTCTCCCAT ACGCGGTAAC GTAATCGACA 
GACTCAGATC CAAAAGGCTT GGTGACAAAG 
GTGACCCGTG CACCTTCTTT TGCAATGCTA 
AGACGTTTCC GCTGTTCAAG AAACTGAAGC 
AAGATGACAT CTGGCTGTAC ACGCTCTAGC 
TTGCcCGCTA TGCTAAGACC CGGCGCTCCT 
AGCGCAGAAT CGTCCACGAT TAAAACAGCA 
GTGGCCCCCT GCCTGTATAC GCTCTGGTGT 
TGACCCTTCA TCGTTTTTCT GGCACAGnCA 
TGTATTCATC CCAAAGAGAG ACTCAGAGTG 
CGCATCCCAA AATCGCTCAA TAACCGCCTT 
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TGGGTGTTAg TGAGCGTGTG 
ATGCGCAGAC GCGTcCCGAG 
CGGTCTCCGG TAAGGACGCG 
AG AGTGC a GG CGCaGGAACA 
GTGTCCGTCT CAGGAAGAGG 
ATCAAGACTC ACCTGTTCGC 
TACAATGCAC GAATCTGCGT 
AAsTGCGCGG CACCGTCCCT 
AAATGGCGAG CTACTGACTC 
TCATCCGAGT TGATGTGCGC 
TCCCCGGGGG CAATCAAAAC 
ACTTCCAGCG CACATACCTG 
ATGTGCTGCA CTACTACCAC 
CGCAGCGCGC TAGGTCCTCC 
TCACGCAGtG GCACGATGCG 
GCACGGTcAC GCGCGGGCGC 
TCTAAACAAT TCAAGTCCTC 
CAGCGGCGCG TGCGGCGCAG 
ATCTTGCGCG AAACCGTGCG 
TCACTTGCCC CCAGCTCCAA 
GAGAGAATGA TTACTGGAAT 
CCGTTCATGT GCGGCATTTC 
ATGTCAAGmG CAAAGCGCCC 
TCAATGACTT TTCCAATAAC 
ATATCATTAG TATTTTTCAT 
GCAGCCTTAC CGCGCGTGGC 
GCCCCACGGG GTTTTTAGAA 
CCCAATGAAA AGAAAAGAGT 
TTGGGCTGTT TCATCAAAAT 
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CAGATCCGCG 15000 

CGAGCCTGCA 15060 

CAcATGGGGA 15120 

CGTGCGCCGA 15180 

CAAACACCTT 15240 

CAACTGCTCC 15300 
CCTGCGCAAT . 15360 

TCCCATTCCT 15420 

AAATAACACA 154 80 

AACAGTTGCC 15540 

ACGCCCACGC 15600 

GTTAAGACTG 15660 

CGGCTGCGGC 15720 

CGTTG ATACG 157 80 

CGTTTGGCGC 15840 

ACGTGCACAT 15900 

CTCGCCGGCA 15960 

C AACTTGTAG 1602 0 

CAAATGCGCA 16080 

AC ATTG CATC 16140 

ATCAATGCGT 16200 

CAGGTCGAGC 162 60 

ATTCATGGCC 16320 

CTTTCTCATG 16380 

ACGGTAGTCT 16440 

ATTCATGCGC 16500 

AAGAAAACTT 16560 

GAGCAGACAT 16620 

AAATAAGTAC 16680 
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GTTTCTGCAG AACAGCACGT CGACGTTTCG ATGCATGGAA CGGTGTTTTA AGTTATGATA 
ATCGAAGCGA ACCATTTTCC TAATATCGGC GTTAATTTGA TATCCTTCCT GTGTTTCGCG 
AAAATATGCA- CGGAGGTATT CGTCCGGTAC TCCAGAGAcC GTGCGCGCGG GTAGTAGCCC 
TGGCGTGCAA CGAGAAGCGA TTTGAGTGAG AG ATC CGACG CGATCACCTG GCATGAGAAC 
GCAG CGGGCG CATACCGTTT CAGCAGCATT GCAATAGTGT ACGGCTCTTC CCCGGTGGAG 
CACCcTGCGC TCCAAACGGT TATGGAATGC TCACC tnCTG CGCTTGGCTT TTACCAATTC 
TGGAATGACA TAG TGC GAGA AGG AG TC AAA ATGGGCTTTG TTACGGAAAA AACGCGTTAG 
ATTTGTGGTT ACCGAATCGA GAAGTGCAGA AAGCTCTGCG CTACTTGCAA GGACCTGCTG 
GTAGTATGCG CACGCAGAAG GGAGGGCAAG TTTCGCGCAG GCGCGATCTA ATTCTACTTT 
CCAGTACCGA GCGATTAAGT GCAGAAAAGG TGATGCCgCT GTGCTCGTAA ATAAGAGTTT 
TAAACGCGGC AAATTCCGCA TCGCTGAGTG TGCTCATCGG TCTTACCCCC TTGCCGAGTG 
TCAGGTAGCT GCGTGTTATA TCGCTTTTTT CGGTCGCAGC CGTCATCCTG CACACGGGGT 
AATTGCTACC GGGTGTACAA CGACAGTGAG AACCGACACT GTGTCTTACG GAAGCGTGCC 
TGGCACCTCG CGCGCCGACA CGGTCTGCTC CCTTGAAGGT GCACGCGCCG CACGGTGTGT 
CAGGAGCGTG ACG CTTGCAG AGATCATGCG CCGTCTCACA TGGCACGGTA CGGTGAGTGA 
CCCGGGCCTG TCTGTGAGGC TACCTGCTCC CAATCACCTT GCTGAGGCTC ATACACCAAA 
AAATCTTGTG GCCGCTGTGC GTGCTGTTCG AGCGCCCAGG GATAGAGCAC CACcTTTCCT 
TCAGAATCAA CAAACCCAAT GGGGTCAAAC TGGTAAAAyT tCGCCTTAGG AT AAC GAGCG 
CGCACATGGC TGTGCATTTC AAGGGGAATC CATGCGCGCG GGATGCACGT CCCCCACACC 
ATTTC C ACCT GGGAGGGAAG CGCATACGCC CATATGAGGC GGGTGCTCAC GTCCAGTGCA 
AACAAGCACA GCGAAACGGA GGTAATCGgC GTTCGCCCTT CGAGACAGTA AAAGTGAAAG 
CGCTTCAACA GTGGTTCGTA GTAGCGCTCA TGAGCGTGAg CATCGAGCGG ATGGACAGCA 
CGGTGCACCA CGTATCGTGC GTGCTCAATG GTATTGAGTT GTCCCCAAAA AACAGTTTCA 
TGCTCGTTCC CGAAAACAAA AACATCCGGT TTGACGCGCG GCAGGGCGCG CGTGGACACG 
GCGTGGGACA CCCGGTGTGC TTTTCTTTTT TGCAGGGAGC CGGGAGGACG CAGACGGAGC 
CTCGCTCCAC ACTCCTTCTT CAAACCAGTA CGACGGGCAC GTTACGCGCA CTGATAGACA 
TTCTTAGCAC GCGCGCTGCA AAATCAGTTC CCTCAAGAAG AAAGTGTGCG GGACCTCTTG 
CAGCGTCTGC GCACACACCG ATGTGCCCAC CACGCACCAA AGCCCGCGCA AAAAACCGGC 
CATTGTCATG CCTAGATCCA CCGCTCCTCG TTCTGACCTC CTGcGTGTAC AAGACCACAA 



16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
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ACGAGGAGCG CCGCTGCCAC TCTAGAAGGG AGCGGGTACT TCTGTCAAGC CATTGCTCTT 18480 

TTTTCTCACT CTCCTTTGCG CGCCTGCACC GTGCTGCTGT GGAGCGCGCA AAATAACCAG 18540 

CATCTGTTGA CTTAGCGGCG GACGCTGTGA TGAGCGTGTT CTTTAACTGC GCAAGTCGCT 18600 

CCGTGAGyGA AACCTTATAA ACGTGTGGAT TTAACAGTCG CACATAGCTT TTATCCAATC 18660 

GTTGCAGTTG TGCGCGC ATC CGTGcATGGA TTTCTTGTAC TGaTTCGCCG GCACGTAACC 18720 

GATTGCCCGC CTGcATGACC GTGCGCAAAA GCGGTTCCAC CTGGTGCGCT GCGAACGTGA 18780 

ATGATTGAAG ATTATAAAAA GGATGTATGT ATCGCTGTGC CACcTGCGGC TCTATCACCT 18840 

CATCTGCAAG TCCTATAACA TCCGC CTTGT ATTGCCCCGC TGCATCATAC AGACGCCATA 18900 

CCTGcTTTAT TCCAGGTGTG GTAGTCTTTG CCGGGTTATC CGAGACTTTC ATGACTGGCA 18960 

GCCAGTGTGC ATCGTCCCAG TGGGGGAGGC GTTGGGATGC TGCATGCCCG TGTTGAGTGT 19020 

GCGCCGGCTG TGTCGTAGCG CGTGCACTCA TCTTGTACAC TCCGGTAAAG GCAGAGTCTG 19080 

CTCCTCCGGT TACCAGGTGT GTGCCTACAC CCCAAGCATC GATGGGAGCA CCGCTTAAAA 19140 

CTAAAGATTC GATGATCGTC TCATCCAGCT CATTTGAAAC TGCAATGCGT GCTTCGGGCA 19200 

ATCCCGCTGC GTCTAGT 19217 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(D> TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

AAAGCATTTG CAAAGACACG TACTGAGTTG GGACAGACAA TAGCAGCTGC TTACCTCGCA 60 

TCAAAGGATG TGTTAACCGT GGGTATTGGA CTTGATATGT ATCAAACAAA TCAGTATTCA 120 

GCTCTTTCTG AGCACATAGA AAAAATAGCG GGGGATAATA AGTTTGGAGC TCTACAAGCA 180 

AAGGCAAGGC AAATTTTAGC ACGTCAAAAA AAAGAATCGT GAGGATGTGT TTTCTATAAA 24 0 

AATCTGTGTA TTAACTGCAG TGGTGTGTCT GCTGTCTTGA ATTCTTTTTT GACGGTGAAT 300 

ATGGAAGTAC TTCGTGTAAC CAGTTTAACG AAACATTATG GCTCCAGGCG CCATCCGGTA 360 

CGTGGGTGTG AAGACGTAAC CTTTTGTGTT GAAAGGGGAC AGGTGTGCGG GATATTGGGG 420 

TTGAACGGTG CAGGgAAAAG CACTGTACTc GCGTGATTGG TGGGTTGATT CATCCGTCTT 480 

CGGGGGAAGT GTATGCGTGT CATTGTTCTT TATCACGCTA CCCGGTAGGt ATCGGCGTCA 540 
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TATTGGTGTT CTGCATGAGC AGAATCCGCT ATACGCAGAT ATGACGGTTG AGCAACATAT 600 

TCTTTTTGTT GCCCGCATAT TTCAACTTGC CGATGGGGAG GCACGCACCG CTGAAATGAT 660 
AGAATTATTC CAGTTGCAGT CTGTTGCACA CAGACGTGTG CGCAATCTTT CTAAAGGATA • 720 

TAAACAGAGG GTTGGGcTTG CGCAGGCATT GGTACACCGT CCCAAACTCC TTGTCTTAGA 780 

TGAGCCTCTT TCTGGTTTGG ATATTGTATA TCTGAAGGAA TTCCATAAAG AGATTGTTGC 840 

GCAAAACAAT AATCTTGCTG TGGTGTTTTC TACGCACGCG GTGCAGGAGA TCGAAGCGTT 900 

GTGCGACGTG TTTGTCTTAT TGCATGCAGG ACATGTTCTT TTCTCAGGAA ATAGAGCGCA 960 

AATAGCAGCG CGCATCGTGC GAGAATTTCC TGAGAAAAAG CAAACAGTAG CATTGCACCT 1020 

TGAAACAGGA ACCTTTATCG CTTTTGTATT TGAGCAGTAT ATGCAATGGC AGAGTGCACA 1080 

GGATGCTGCG TGCTATGCAG TGTAAACAAT TTTTTACTTT GTATAAAAAG GAGCTGCGTT 1140 

CTCTACTCAC TTCACCGGTA AC TT ACGTGT GTC AC GTACT ACTGCACCTT GGTCTGACCA 1200 

TACCGTTCAT TGGAGTAAAT TTTTGGTTAA ATGCGGGGAT ATCTGAGCTT CAAAGTTTTT 1260 

TTCTTAATGC ACCACTTCTT TTCTGCATTA TCATACCGCT GCTGACAATG CATGTATGGT 1320 

CTCATGAGCG AAAGTCAGGA ACCGATACAC TGC TTTTTTC TTTTCCGATT GCAGAACGAA 1380 

CGATTGTTTT GACAAAGTAT CTATCgCTGC TTTCAGTGTA CGGTGGGATG ATTGTTGTCA 1440 

GTACTGCTAT CCCTCTTTCT ATTTTTTCTC TGGGATATTT TGATTATGCA CCCTGTGCTC 1500 

TTGCATACGT GACGCTTGTT CTTTTTGGTG CAGCTCTTCT TTCGCTGTCT TGTGCGGTAG 1560 

CCAGCTACGT TTCTTACGCT GCAGTGGGTT TTGTTTTGAA CTTTAC GCTT GCGGTGATGG 1620 

CATTGCTGGT GCATATTCCC GCACGAGTGT TCATATCACA CAGATATATA AGGGCATGTG 1680 

TTTCGTGGGT TTCTTTCGTA TATCATTTTG AATCTGCCGC TCGTGGCATA TTCGATTTAA 1740 

GCGATTTCGC GTTCTATATT TTTGTAGCGA TAGCGGGTAT CGAGTTGCAG TGTTTGATTG 1800 

TAAGGGTTCG TTTTAGGTGA GCAGAAAACA TCATATACCC TGTACCGTGA TGATTCTGAA 1860 

TATAATGATG AGCGTGTTTG TGACGTTCTG TACACCTGTC CGGTGTGATT TAACAGCACA 1920 

GAGAGCATAT TCCCTTTCGG CACACACCAT TAAGCTTTTT GAGAGTGTCG AAAGTACTGT 1980 

GGAAATAACG TGGTTTTATT CCACCGATGT AGATAGGTAC ATTCCTACCG TCATATATGT 2040 

GAGAGATTTG CTTAAAGAGT ACGCTCATCA GCTGAGTAAG CAGTGTGCAG TAGCGATGAA 2100 

GGATATTAAT CTCCTTTCTC AGTCTTTGAG GAAAGAACTT GGATTTGTTG CTCGGCGCGT 2160 

TACGTATACG CGTAACACTG CCAGCATAGC GTACGATGCG TATTCTGCAA TACTTGTTGA 2220 

ATATCGTGGT ATGGCTCGTG CCGTACCCTT TGTGTCTGAC ACCAAAAGGC TGGAGTATGA 2280 
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CATCGCGCGT TTGATCATCC AGATGCAGCA GGAAATGAGT GCAGATATGA 
GATATATGTT CTTGCTCCAC CAGAAAGTTT AAGTACCACA TATGCCCATG 
TTTGCAATCT GAAGGAT t GC TCCCAGAGAT TCTCTCtATT TCTTTGCCTC 
CCGTATTCCA CTTTTGaTTT TAGGTtCyGG CTACGTGGaT GAACACGcCG 
TGATGCTTTT TTGCAGAAGG GAGGAAACGC ATTGTGCTTT GTATCcAGGA 
AACTC AATG A TCAATGGACT GTTGAGGAAA AGCGCCATGA TTTTCTTATT 
GCACGTACGG AATTACTATT AACTC AGATC TCATTCTCGA CGAGCAAAGT 
CGTTACCTTC AGTTTACGAA ACTCAATACG ATAGAGTGTC TTATCCGTTC 
TTACTTTGAA ACCGTATACG CACGGAGTAC CTGTAATGGT ACAAGCGGGA 
TTCGATTATT TTGGCCCTCG TCAATACGAG TTTCTTTTCC TGCCCGTGTA 
CGAGTAATCA TTCTCTGTGT ATGAC TGCGC CTTTTAATAT TGATCCTTCT 
TGAAAGATCT TGCAAAAGGT AAAATGCCCG CTCCCCAGGC ATTTGTTGC A 
ACCCTGGAAA GCTCATGGTA GTGTCCGATG AGTACATGGT CAGTGCAATT 
CGCACAACGG AGAAAATCTT GATTTCATGA TAAACTGTAT TCAGTGGCTG 
ATGGTTTACT TATGCTGAAA AGCAAGAATC CCGCGTGGCT TCCATTGAAA 
ATGAACAAAA GTTCGCACGC ATTGTGCACC GTGCGCGCTA TCTGAATATC 
CTGTGCTTAT AGGAATGCTG TTTGTGGTGA TGCAGATTCT TTATCGGAGA 
GGTTATGCGA TCTGTGGATT CGCGTAGCAG CGTAACACGG TGGGTATGTT 
GATTTTGTTT TGCTTTTGTA TTGCGGTGAT GAGGTATGGG GGAGTAAAAA 
CTTTTATGGA TTTTGTCTCC ACCCTAGAGA ACGGGCGGAT ATAACGGAAG 
TTTTCCAAGG GAGGAA 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11628 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



TGTCCCGTGG 

TATTACCGCG 

AGCTAGATAC 

TAACCTTACT 

AATAGCGTGC 

AATCTCCTGA 

TTTGCTGTAT 

TGGCCAGTTG 

ATTCAGTTCC 

TTTG AG TCTA 

GTTGATCACC 

TTTCGTGATT 

GTGGAACATA 

TGTGGTAACG 

TCTTTCCGTG 

GTAGC TATCC 

AAACGGTGAG 

TAACCTCAGT 

AGAGGCGTTA 

TCATTCTCCG 



13041 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3496 



(XX ) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
GTTAATGTGG AAATGAATTC ATTTCCAAAA TTCTCCGCAG TGACGTATAT GACGTTCAGG 60 
TCTGTTGTCT TGTAGATCTC GTGTCCAATA GCCTGCATAA GGTGGGTTTT TCCTAGTCCC 120 
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ACTCCACCGT AGATAAGTAA CGGATTGTAG GAAGTGCCTG GGTTTTTTGA TACGGAGATA 
GCAGCGCTAT GGCTGAATTT GGTTTCTTCT CCGGATACAA AGTTCTCGAA GGTATAGTCT 
CTGTTCAGGT CGGGGTGAAA GCTCTTTTTG GAAGGAACCT CTGCAGGAGA GTTTTTCTCC 
AGGTAGGTAT GCaCGTGTTT GGGGGGAGCA GTATTTCCAT GAGGGGTGCC TTTTTTAACG 
GCAAACAAAA GTTTAATGGG GTGTCCAGAA AGTTCGAGGA ACTTGCGCTC AAGCTTTTCT 
TGATATTTTT GGCTAAACTG TATTCGGAAA AAGTCTGAAG GTACTGCTAT TTCGATAGCG 
TTTTCAAAAG ATGCGATAAA GAACAAATGA GCAAACCACA TGTTAAATTC TGCTTCGGTC 
GATTCACTCC GTATCTGGCT GAGTGTCTCG TTCCAGAATA CTTCATACCC TACTGCGTCC 
ATCTACCTAT GATACAACCT ATTGTATTTT GCCTGCAATA AACGAAGAGG TTATACGCGC 
GTTGCTTTGT GGGTGTAGAT TATCTTGTTA TTCAAGAGAA GTTTTTATGC TACACTAAGC 
GGCTCTTGTT TAGTGTGGGG CTGTTGCGCG ACAGTATACC GTGAGCATGC CCGCGAGAAA 
TGGGGAGTCG GAGTGGTTAT GAGGTGTGAT GCTACGCAGG AAAAACGTGC GCACTCAGAA 
TCAGGGGAGA GTGTTTTTTT CCAGAAGTTT TTGGAAACGC GGCAAATTCT CCTTTCAGGG 
GAAATAAGTA AAGACCTCGC AGAGGGAATA GTACGGCAAC TCTTTGTATT GGAGTCTCTT 
TCCGTTTCGA AGCCCATCTA TATGTACGTG GATTCTCCTG GGGGGGATGT GGATGCAGGG 
TACGCTATTT TTGACGTTAT TCGCTTCGTC AAGACGCCAG TGTACACAAT TGGAATGGGG 
TTGGTTGCGA GTGCTGGTGT ACTCGTTTTG CTCGCGGCAA AAAAGGATTG TAGGTTTGGA 
TTGCGCAATA GCCGGTACTT GATACACCAA CCCCTTTCTG GTATGCGTGG CGTTGCGACA 
GACATAGAAA TCCACGCACG GGAGCTTGAG AAAACGCGAT CGAAACTGAA CGCTTTGATC 
GCAAGTGAAA srrGTGTGAG CTTAGATAAA GTTGCACAGG ATACAAATCG AGACTACTGG 
CTCGACGCTT CTCAAGCACT AGAATATGGT CTCATTTCGA ACCTGATTGA AAAAAGGGCG 
GACCTTCCTA AGAAATAATG GATACCGAAT CTGTCCTCTT TCGCGCGCAG TGCTTGCGTG 
CAGTGCGTGA TTTTTTCCTT GAACACCACT AC AT AG AGC T CGATACGCCT GCACTCGCCC 
GTGCGCTCGT TCCAGAACGG TGTCTTGAGG TGTTTCAAAC CGAGTACTTT ACGTCAgTGC 
ATGCTAAAGA TACACAGAAG TTATATCTCG TTCCCTCTCC TGAGGTTTTT CTGAAACCGC 
TCATCGCGCA ACTGCAACGT TCGGCTTTTC AGATCTCAAA GTGCTATCGC AATGGAGAGT 
CCATGGGCGC CTTGCATAGG CCGGAATTTA CTATGGTCGA ATACTACACG GTGTACGCTG 
ACTACAAGAC GTCGCTCGAT GTAAGCAGCA AACTCTTTCG CTTTGTGGTT GAACAAGTAC 
AGAGTCATCC GCTCGCGGAC CCATATTCGT GTGCTTGTTT TTGTGCTCCC TTCGAGTACG 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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TGACGGTCGA GGAAGCTTTT CTCCGCTATG CAGGCTTTTC CCTTTCGCAC GCGAGTAGTG 1920 

TACAGACGCT TGCGCAGGAA GTATTGCGCT CCGGAATAGA CCTGGGAGCA CGTGCGGGGG 1980 

TCGATTATAC CCAGTGGTCA TGGGACGATT TGTACGAACT GTTGCTCGTG CATATTGTTG 2040 

AACCAAAGTT GAGGTCAATA AAGGATCGTT GCGTCGTGCT GTATGACTAT CCTATACAGA 2100 

TATCCTGCCT GGCGCAnGAA CACACTGGAC GCTCAGGGAT ACAATCTACG TCACCTAACA 2160 

AGGGTGACGC ACCTCACTGG GTGGTAAAGG AACGGTGGGA ACTGTACGTC CGCGGTGTGG 2220 

AACTCATAAA CTGTTACACA GAGCAGCGGG ATGCGAAgcA TGTTACCCGG TACTGCAGGG 2280 

AAGAACAAAC CGCAAAACAG GGATCTGCGC GAGTTGTGCA TCCTGTTCCA GAGGGCTTTG 2340 

CGCACGCGTG CgcACGCATG CCCCCTTGCT CTGGAGCAGC ACTCGGATTT GATCGCCTGG 2400 

TTGCGCTGCT AGCCGGTCGG CACTCATTAG ATGCGTTTGT GTATGATCAG TGAC ACTCCT 2460 

CCTGCCTTGG AGAAGTTAAT TGGAAGTTTC CTGGTTGTAT TCGATGAGCG TTCTCACGGG 2520 

AAGATCCCCA ATCAGCTCAT GGTACCGTAG aATGGTAAAC CCACAACGGC GAAGAAGCCC 2580 

* ACCACTTCTG CCCCGCCAGC CCGGAGCATC GTGCGCGCTG CATTCAGCGT TCCACCGGTG 2 640 

GCAATCAGGT CGTCTGTTAA CAGCACGCGG GCCCCCGCGA CTACATCGCT CTTGTGAACC 2700 

TCAACGGTCG CCTTTCCATA CTCTAAGGAA TAGGAGCACG AGTACGTATC CCCCGGTAGT 27 60 

TTCCCCGCCT TCCGAACTAA AATAAGAGGT ATTCCCATGC GATCTGCAAA AGGCGCGGCA 2 820 

AAAATAAAGC CACGTGATTC GATTGCTGCG ACCGCGGTAA CGTGCTC ATC GCGGTAGAAT 2 880 

TCCACCATTT GATCAAGACA GTAACGAAAT ACAGC CGCGT TCATCAGCAC GCCAGTAATG 2940 

TCGTAGTAGA GAATTCCTTT TTTAGGGAAA TCAATCCGCT TACGAATTGC GCGGTCCAGC 3000 

GCCGCGTGTC CGTCCACAGG GGCATGGTAA CGTCCAATAC CACGCACGTC AATGATCTTA 3060 

CCGGTTTGTT GGGAGGCTTG GTGGATTGAG AATTACGTCT CCTGGAAAAA AGATTTCGCT 3120 

GAAACTTCAC GAAATCTCGG TGAAAATAAA TGATTATTTT ACCAATCGGT GAAAAAAAGC 3180 

CGGGAAAAGT CCAAAAAGAC AGTGGTTATG CTCCATTTCT TTCGATTTTT TGTTGGCATG 3240 

GTTTTTGCTT TAAAGTTTGG AGGAGAAAGA ACGATGAACA TGTGTACAGA TGGAAAAAAA 3300 

TACCACAGCA CCGCCACGAG CGCTGCAGTT GGAGCCAGCG CCCCCGGTGT ACCGGACGCT 3360 

CGTGCCATTG CTGCTATCTG CGAGCAATTG CGCCACATGn TAGCGGATCT GGGAGTACTG 3420 

TATATCAAGC TACATAACTA TCACTGGCAC ATCTACGGCA TTGAGTTTAA ACAGGTGCAT 3480 

GAGCTCCTTG AAGAGTATTA TGTATCAGTT ACTGAAGCCT TTGATACGAT TGCCGAGCGG 3540 

TTGTTACAGC TGGGCGCGCA GGCTCCTGCG TCTATGGCTG AATAC CTTGC GTTGAGTGGA 3 600 
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ATTGCAGAAG AGACGGAGAA 
GATTTTGAAT ACCTAAGTAC 
GATGCAGTGA CTGACGGCAT 
ATGCTTGGTG CTACCCTGAA 
CCATGCGCTG GAAGTCCTGT 
GAGGGAGGTG TTGGTTGAAG 
GTTTTGTGGA AAAAATTGTT 
CCTGTGCTAA GTCCATACGA 
ATGCAGTCAC CCACGGTATC 
GCgTGCAGTG CACTAtACGC 
CTTTGGCTCC TCACTCATTG 
TGGAGCGAAg TATGTGTTCG 
TACGTATACT GCGAGTTGCC 
GGTTATTTGG GGATTAGCGT 
ACGGTGGCTG TCTCTCGTGA 
GCCGTTGCGG GAACGGCTCC 
CTACACGGTT GGTTGTGTAT 
GCATATGTTC GTCATCGGCG 
AATCCATAAG CCTCCTATGA 
CTGACGGAGC GaGCGAGTTC 
CCTCAACGCA TTGCATAACA 
aGGTGTGAGA CGTAATCCGA 
CGTGTTGTAA AAAAATGCCC 
GAGATCTACC AACGTTACCA 
TTCTACTGCA ATATCAGAAC 
AGGCGCGTCG TTTACGCCGT 
GGAAATTTCT CGTTCCTTAT 
GCGTGCAGCG ATGGTGTGTG 
CCCACGCTTG TGCAAGGCAC 
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AGAGATCACT ATCGTCTCTG CkCTTGCGCG CGTAAAGCGG 3 660 

GCGATTCAGC CAAACGCAAG TACTTGCAGC TGAAAGTGGG 3720 

TATCACAGAC ATACTGAGGA CGTTGGGAAA GGCCATTTGG 37 80 

AGCCTAGGTA GAGCAGGCTG TACGTACAAC ACACGTACGG 3 840 

ATTTTGCACA TAAGGCCTCT CTCCCGTTAC AGCATGAGGG 3900 

TGCTtGGGGA AGTGTGCATA ATCGTC cTAC GGAAGGGGGC 3960 

AACGCAGACG GATCGGATGC TATCTGTCCT GCGTCTGCGG 4020 

TCTTACCAGG AGAGCTATTC TCTTGGTGAG GAAATCGCAA 4080 

GGTGTCGGAC TATCCAwCtT GCACTGGTGC TCCTGGTGGT 4140 

CGGCTGACTT GACGGCTCGC TATGTTGTTG GTTTTAGTGT 4200 

TGCTGTACCT GTGCTCTACG CTGTACCATG CTCTGCCTCG 4260 

GTGTTATTGA TC AC TGTTGT ATTTACGTGC TCATTGCAGG 4320 

TGACTACACT GTACGGCGCG ATCGGATGGA CTGTTTTTGG 43 80 

GTAGTGGGAG CGTAATATAC TCCGTGTTTG GGCATCGGGT 4440 

TGTATATAGC GATGGGGTGG CTGGTAGTGT TTGT AG C AAA 4500 

CTGAGATTAG CTTTCTGTTT TTGGTATtAG GAGGCGTGCT 4560 

TCTACGCACT CAAGAGAATA AAGTGGACGC ATACTATCTG 4620 

GTAGCGTCAT GCATTTTTTT TCGCTG T ATT TAAGCTTTTA 4680 

TAGATAGGAG GTTCGTTTCT TTGCGCAGAC CGCATCCTGT 4740 

GCGCAGTCCT TTATGGTGAT GAAGACTGAA ACTGGTTCAA 4800 

CCGAGACTGA GCTTAAACTC ATCGCTGCTG CTGCAAGCAT 4860 

AGAAGGGATA TCCGAGTCCT GCTGCTAGAG GAACGCCGAG 4920 

AAAATAAGTT CTGCTTCATG TTCCGCACCG TTGCAATGCT 4980 

CGTCCCGTAT GCAGTTTCTC ATCAGGACTA CGTCTGCACT 5040 

CTGCACCGAT GGCGATCCCA ACATCGGCGG ATGCCAGTGC 5100 

CTCCTACCAT CGCTACCATC ATTC CGGACG CTTTTAAAGC 5160 

CATGAGGGAG TAACTCCGCT TTACTTTTCT TGACACCACA 5220 

CAACGTGTTT GACGTCTCCC GTTAGCATCA GCGTTTGGAT 5280 

CAATCGCTGC AGAAGAATGT ACCTTTACGG GATCTGAAAC 5340 
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AAAAAGAACT CCTACGAGAT TTTTATCCGC TGCTACAAAT AAGGGCGTTT CCTCTAGATT 
GTGTGATGGA GAGAGATATG TGTCCATGCC ATCAATACTG TGTGCGACCA TCATACGTGC 
ATTGC CTACC ATGACGGTCT TTGCATACGA GGTATGCACT AAGCGCGCCC GTAGACCGAG 
TCCTTGTTCT GAGTTGAAAT CGGTTATAGC AAGCGGTGTC ATTC C TTT AC GCTGTGCAGc 
TACGCTAATT GCAGCTGCAA GCGGATGGCC AGAGCATACT TCTAAGCTGT ACGcAAGGTG 
GAGTATGTCT TCTTCGTTAT AGGTTGGATG GAGCGTGTGT ATGTGTGAAA GTGTAGGACG 
TCCTAAGGTG AGGGTGCCAG TTTTATCGAA CGCTATTACT TTCGTGCGTG CCATTTGCTG 
GAATACCTGC GCTGATTTTA TGAGAATACC CATCTGTGCA CCCTTACCCG TTGCAACCAT 
GAGCGCGGTA GGGACGGCAA GTCCTAACAC GCACGGGCAT GATATGACCA GGACAGTGAC 
TGCGATAGAA AAGGCAAATT CTGCAGACGC TCCTGCGCAT AACCACGCGC ACCAGaGAGC 
AAGGAGAGTG CTACGATTGA TGGTACGAAT ATGc GCTGAC AGCGTCGACT AGTTTGGTGA 
CCGGAACTTT AGACGCAGCA G TTTTTTCT A CCAATGAGAT AATTTGCGCA AGGGTGGTAT 
GCTCCCCTAC CCGTTCAGCA CGAAATTTGA GGAACCCCGT GCTGACTAAG GACGCAGAAA 
TGACGGAATC TCCGCGTCCT TTTTCTACCG G a ATACTTTC CCCTGTGaCG TTTGACTCAT 
CGAGCGTGGC CTGCCCGGAT GTGATGATCC CATCTACCGG AACTAGCTCA CCTGCTTTTA 
CAAGTACGGT GTCTCCGACA AGTACGTCCT GTGcAGGAAT TTCTATCTC A ATTTCATGGG 
TCTCATGGGC TGATGcAGCG CTTGCAGTTG TTGGGGAAGA AGGGGATGCT CCGCGCGGAA 
CAGATACcTG ACGGATAACG CGAGCCGTTT TAGGTTTTAT GTCTAGCAGT TGTGTGAGTG 
CGCGAGAAGT GCGCCCTTTA GACAAGGCGG ACAGGTATTT ACCCACCGTG ACGAGCGTTA 
CGATCATTGC AGCTGATTCG AAATACAAAT CCGCCACATA GTGCGATACA AGTGCCGTGT 
CGTTGGCATG CACGCCCATT GCTATACGCG CCGTGGCAAA GAGACCGTAT GTAAAAGAAC 
TCAGGGAACC GAGAGAGATG AGCGAATCCA TAGTTGCAGT GTTGCGTCTC AGAATTGCAC 
CATACAACGC AATAAGTCCT GCACGAAAAA GAGAGCGATT GGCGTACAGG ACAGGTAATG 
TCAGAAACGC CTGTACAAGG GCAAAGGAAA GCGCATATTT CAGGGGGTGC AAGAACCCAG 
GGATCGGTAG GTGCACCATG TGCCCCATGG ACAGATACAT AAGGGGCACG AGTAAGCAGA 
GAGAAGTACG GACACGCCTT TTGAGCGTCA CAAAATCTGG ATGTACCGGC TGGGTTGCAG 
CAAGCGGTGC GGtTGTCGAA TGCGTATCTA AAAGCGTGGC TTTGAATCCT GCATGTGAAA 
CTGCATCGAT GATGGTCTGA GCAAACAGGG TGTGCTCAGT AGGGTGAAGA TCAGTGTGTA 
CGTATAAATG GCTGGTGGTG GGATTTACGT AAACGTCGTA TGCGCCTGTC ACGTGGCGCA 
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CTTGCATGAA AGACTACCTC 
GAGGAGCGGC TGGCAGTGAC 
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ATAGTGCTGC CTCCTGTCTA 
GAGCAGCGGC AGGCATTTAG 
TGCGATGGGC ATAGCAGACA 
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CATGCAGGTA TTGCCACGCA 
AAGCGTGAGG TGACGCGCGA 
CAAGACACTA TTCGCATGCA 
CGCTTTACGC TTGATGCAGG 
AACGTGGCTT GCTCTATCGT 
CGCTGTCTGA CGATGAGGTT 
ACCCTCTTTT ACCCCGTACT 
CTCAGGTGGG GGAAACTATC 
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GCGTGCCAAT GGTGAACCGC 
TTGGAACCGG TATGGTAAAG 
CGCGCCATTC GCTTGAAgCG 
TGCCTGCTGC GTATCGGGGG 
TGCAGGCGCA TGGGCTCCTG 
ATCGCTGCGA AGCAGTTATT 
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AATGGGAAAA TACCTATGTG 
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TGAGTCGGGT GGATGTGCAG 
CTGACGTGTT AGATACGTGG 
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ACGCACGCAG nanAgnATAT 
CTATTCAGGA CGGGTTTTTT 
GGCAAGAAAC TTGCATGTAC 
AATGGGAGCT CAACGAATTA 
TGGGCCTGCA GATTTTGAAG 
CCCGCGTGCG CGCGGgAGTG 
GATAGAAGGG GGTGCGCGTA 
ACTCCATATG GGGCACTGTC 
CATGGCCGGT GCGTGTACGC 
GCATGTGGTT GAACGCGCCT 
ACAATTCGTT GCACGAACGC 
GTTACGGAAG ATGGGGGCAT 
TATGTCAGCC TCCGTACGCG 
AGCATGTACT TGGTTAACTG 
TTTCATCAAG AAAAGGATGG 
GAAGAAGAAG GAAACGGCGT 
ATCATTGCTA CTACGCGCCC 
GATGATGCGC GCTACCAATC 
ATTGTTCCTA TTATTGCTGA 
ATTACTCCTG CGCACGATCC 
ATTAATATGC TCAATCCAGA 
CTTTCGTGTG CTCAGGCACG 
TCCCGTGAGG AGCGCATAGT 
GAGCCGTATC TTTCTCTGCA 
GCTGCGTGGA AGCGTGCGGA 
CGGTGGCTTG AGCACATTCG 
ATCCCGGTGT GGTATTGCGC 
CGCTGTGCTC ATTGCGGCAG 
TTTTCCAGTT GGCTGTGGCC 
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ACCGTGAACA ACAAATGACA 7140 

ATGTATCCAA AAGCTCTGGG 7200 

CGGATAAAAA ACCGTACACT 7260 

TGACCCAAAA ACTGCAAAAA 7320 

CGCGTGTCTA CGCATGCTGG 7380 

GAACGTCGGA TAGCGAGGGG 7440 

CCTTTGTCAT TGCTATCCCA 7500 

TCAATACGGT GTTGC AGG AT 7560 

TCTGGATTCC GGGAACTGAC 7 620 

TGAGGAAGGA AGGCATCCAT 7680 

AGCAGATAAA GGATTCCCAT 7740 

CTTGTGATTG G AC CTGTGAG 7800 

AAGntTCGTT ACGCTTTATG 7 860 

GTGTCCTCGC TGTGGCACCG 7920 

CGCGCTCTAT TATGTTCGGT 7980 

TCCCCCTCCA TTAGGGACTG 8040 

TGAAACCATT TTGGCAGATG 8100 

TTTGATTGGA CGTAAGGTAT 8160 

TTCATATGTT GCGCAGGATT 8220 

GAACGACTGG GATATTGGGA 8280 

TGGCTCGCTC AATGATCAGG 8340 

GATACAAATC GTTGCCGATT 8400 

GCATTCGGTG GGAGTGTGTT 8460 

GTGGTTTGTC AAAATGAAAC 8520 

CGTGCAGTTC CATCCTAAGA 8580 

CGACTGGTGT ATTTCGCGCC 8640 

ACAGTGTGCA CAGCAAACGG 8700 

TGCGGATATA ACGCAGGATC 8760 

TTTTTCTACT CTTGGGTGGC 8820 
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CTCAGGAAAC GCAGAArctG CGCGCGTTTT ACCCCACGTC TGCGGTCATT ACCGCGTATG 
ACATTATTTT CTTTTGGGTG GCGCGCATGA TAATGGCGGG GCTGGAGTTT ACGCAAACGG 
TTCCTTTTCG AGATGTGTAC CTGcACGGTT TAGTGCGTGA CAAGCAGGGA AGAAAGATGA 
GCAAATCACT CAACAACGGG GTGGACCCGC TGCACATTAT TCGCACGTAC GGTGCCGAtG 
cAtGCGTTTT ACGCTTGCCt TTATGTGTGC gCAGGGGCAG GACGTGTTGA TAGAAATGGA 
TTCGTTCAAG ATGGGTTCGC GGTTTGCGAA TAAGGTGTGG AATGCTTCTC GTTATATTTT 
GGGCAATCTC GAAGGCAGGC GGGTGTACGC TATTGCGCAC GTGTCTCTAA CTGAACTGGA 
TCGCTGGATC TTTCACACAT TTAATGAAAC TGTGCAGCAG GTGCGTACAG CACTTGAAGC 
GT AC CGTTTT AATGATGCGG CACAGGCAGT GTATGAGTTC TTTTGGAACA GCTTTTGTGA 
TTGGTATGTA GAGGCAAGTA AATGCTCGTT TCAGAAACCT GATGAACAGG AGAAGGATCG 
CGCAgCTTcA GTGCTCTGTA CCCTTCTGGA AGAGACGCTG CGACTGCTCC ATCCTTTTTT 
GCCGTTTGTA ACAGAAGAGA TTTACCGGTC cTGTCGCCTT CTGTGCACGA TACCACCCAA 
GCAATTCCGT CTGGGGCGCA CGCGTTGCTC ATGTGCGCGC CATATCCGGT GTATGTGCCG 
TCGCGGGTAG ATGCGCGCGC GTGTGCGCAT ATAGGTGCGG TGCAGGAAAT AGTGCGTGCG 
GTGCGnTACT GCGCGCTGCG TGTGGTATTG ATCCGCAAAA AGCTGTTTCA GTCAGAcTGC 
GTCCGAGTTC TCCGGCGCAG GATGCGAACG CCGCAGCGCA GGTGTCCTGT GTGCACGATC 
CGGGAGCGGT GGCGCGCACA TATGAGGAAT TGATTTGTGT GTTAGCGGGT ATTTCCTCGC 
TTGTGTATCT TGAAAGCGAT GCGCCTAAAC CGCAG t TGCC GTTGCAACAG CGGGGACAGG 
GTTTGAGCTG TTCTTAGTAA CGACGGAAGG AATTGACCGG ACGATGCTGT GCGCGCGTCT 
TCAAAAAGCG TGGCAGAAGG CGCGGCAAAA AGTGCAGCAG GTGGAGCGTA AgcTTGCAGA 
CGCGCAgTTT TGCACGCACG CTCCTGAAGA AGTGGTGaCC GC AGAGCGC A AGAAACTGGC 
AGAGGCGCGC GCAACGTGCC ACACCCTTGC AGGATATCTT GCGGACATGA ATGGAAAGCC 
TGGACCGCTC TCTGACTCCG ATTAGGGTCC TGTGCCCCTG AGCAATCCGT TTAGCAGCAC 
GAACAGCCCA TAT AC CGCGC ACAGGAGCAC ACCGGCAGGg CGGGTGAGGG TCCTGCGGCC 
GAGTGCGCAC GCGTGAAAGA TTCCCACGAC TAACAGCATG GCAGGCAGGT GTAGCAGAGA 
AAAGATTTTT GGCACCGGCA GGCCGTGTGG CGTAAGAGAC GCGGCAGCTC CGACTACAAA 
CAGCACATTG AGGATATCCG CACCTACTAT GTTTCCCACT GCCAgTGCGC CGTGTCCGCG 
GCGTACTGCG GTGATGGCAG AGACGAGTTC TGGCACGCTG GTGCC AAAGG CGATGATGGT 
TGCGGCTATG ATGCCTGCAG GTACTCCTGC GCGGAGCcmA TGATTTCTAC CGTGGGGATG 
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AGGACGCGCG AACCGAGGAC GAGGAACCCG ATTCCCCCTC CTAATTGCAG GAGCAGGCGG 10620 

CATACAcTGC GCGTATCAGT CTTGTCTGCG GGGAGCGCTG CTGTGCGGGT GTCTGGAGCG 10680 

TGTGGGAGGG CCGACCAGCG CAGAGAAACC CACAGGTACA GCGCGAGCAG ACTGAGAAAC 10740 

AGCCAGCCGA CGTACTGATG CACCCGCGCG CCAAAGCGTG GCAGGGTTAC CCATCCGAGC 10800 

GCGCATACGA CGAACAATTG CACCCGCGCG TGCCGGCGCA TCAAGTGTGT GTCGAGCGCG 10860 

AGCCCGGGGC GTGCAAGGAG TGCCCCGAGT CCGAGAATGA AACCGGTATC CACCACGATG 10920 

GATCCTATGG CGTTTCCGAG TGCTAAGTCG GCGTTGCCsC AGAGCGCAGC GwATACAGAC 10980 

ACGGCTGCCt CGGGGGTGGT GGTGCCCAGG CTCACGAGCG TGGCGCCCAG GAGCGCTTCG 11040 

CTGATCCCCC AACGCCGGGA AAGCGCgCTG GCGCTCTCTA CCAAGCAGTC TGCGCTGCGG 11100 

GCCAGAAAGT AGAGCGCACA GAGCAAGACG CCGAGTAAGG TGGGGAGTGT GCGCGcCGCA 11160 

AGTGCGCTGC GTACAAACGA TTCCATAtGC GTACTGGAAC GGTATCACAC TGGGGGGAGA 11220 

ATGGACAGCa GGGAGGAAAA CGCTCATAAT GACCGCACGT GAAGTGGTCG CTCGTTCTTT 11280 

CAGGTGGTGG TGCGCGGGGA ATTGCCCACA TTGGGGTGCT CAAGGCGCTT GAAGCGCTAC 11340 

AGGTTCCGCC GCCGCAATGT GTCGTAGGAT GTTCTATGGG TGmGsTGGTG GGGGCGCTCT 11400 

ATGCGCTGGG GATGTCGGTG CGGGAGATGG AGGCGTTTTT TCAGCGTGAT TTTGTTATTT 11460 

CAGACTATGT GAATGCACGG GATCCCTCTG CGTGCGTTGA GGCGGGGAGT CnATnnGCCA 11520 

GCAAAAGGCC AGGAACCGTA AAAAGGTCGC GTTGCTGGCG TTTTTCCATA GTCnGGCCCC 11580 

CTGACGAGCA TCACAAAAAT CGACGCTCAA GTCAGAGGTG GCGAAACC 11628 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

ATCGTGGAGG CAGTGGATAA AATAATTCAG CCAGTTTTTT TGTCTGCACT TACCACCTTC 60 

GTTGGTTTTG TATCTTTTTG TTTTACCTCT GTTGTGCCTA TTTTTGAGTT CGGCGTGTTC 120 

GCAAGCGTGG GCGTGGCGTC TGCGTTTGCA TGGCGCTCAT GCTTATCCCC TCGCTCCTCA 180 

TTATCCGTGG GCCTGAATCG CGTGTGTGTG CGCATGCTCC CGATGCCGGT CATGAACACA 240 

TGGATACGGC GATCACCGGT ACGCTGATGG TAATCGCCCA TCACTATCGG ACGGTGTTGT 300 
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TTGTTGCATT CCTTGCTGTT GTATTTTCCC TGGTGGGGAT GTCACGTTTG GTAATTGACA 360 

ACGTGCTTAC GGAATACTTT GAGCCGGAGn TAACAGTGGT GC a GTCTGAT CGCTTTATGC 420 

AGCAGCAcTT CGGTGGTTCT CGATCGCTCA CCGTATTAGT GAGTACCCcT GCGCGGGATG 480 

GCAGTGTAGC ACGTCCGGAT GTACTGAAGG CTATGGATGA TC TG AC TG AG TTTTTACAAA 540 

CGCGGGTGGA GCATGTGGGA AAGGTTATTT CTCTCGTCCC GCTTATCAAG CGCATTAACC 600 

AAGTGTACAA CGCAGACgCG TCGGCGCGAG GCCTGGAGGC GCAGTCTGCA GATGTGGTGC 660 

GCGGTGGTAC GGATGACTTT GGTGTTTTTA AAACATTCAC GGGCGGACAT GAGGAACCTG 720 

CGCGGGCGGA GACGTCACGT ACTTCC TTGG CGGCGCCGGG GTCATCGTAT GATTTTCGTC 780 

AAGCAGTCGG TATGCTGGTA AGTGCCGTGC GGGATTCTGA TTTTGATCGT TCAGATGCGC 840 

AgCAGCTcGT GCAGGCTCTT GAGAAGGCGG TGAACTACGA TGGGCGCGCG TATTATGAGA 900 

TACCGTGTGA TCCTAAGAAA TATGGGGTGA AAACGAGCGA GGAATTGCAG GAAATTATCA 960 

GTGGGTATTT GTTACTGCTT tCAGGAAAAG GGTTGGGTCT GGTGGATCGT GCCGTAGACC 1020 

CCCGTGCGTT AAAGATGAAC ATCCAGCTCG GAACTAAGGG TCAGCAAGAC TCATACGGTG 1080 

TCATTGAGGC AGTAAAAAAG TTTATCCGGG AAAATTTTCC TCAAGACGTG CACGCTGAGT 1140 

TTGGCGGCTC AGTATTGGTT GAGCAATCCT TGAATGATCT GGTGGTACAA TCTCAGCTGA 1200 

TTTCACTGGT TTTtTCTTTG TGTGTAGTTT TTATCATCAT CGCAGTACAT TACCGCTCGC 1260 

TGTTTGCTGG TATAATCGGT ACCCTTCCTT TAGGAGTATC TGTGTTGGTG AACTTTGGGG 1320 

TTATGGGATT TTTtGGCATT AAGCTGAACA TTTGCACCAC GATGGTGgCA GGCTTTTCAA 1380 

GCGGTATTGG GG TC G ACT AT ACGATACACT ATCTGGCGGC GTATCGGCGC GCGTGGAAGG 1440 

AGTGTGGTGG AAAAGATTTT CTGACACAAA CATTCTATGG TTCAGGGCGG GCAATTCTTT 1500 

TTAATGTTCT GTCTGTAGGA TCGGGATTTG CAGTGCTGAT GCTTTCAAAG TTCAATGTTC 1560 

TTGCTGATTT TGGTTTGCTT ATGGTGTTGG CTATGCTTAC AAGTTCAGTG GCGAGTCTCA 1620 

CGCTCCTTCC TACCTTACTG AATGTGGTCA AACCAAGGTT CATCACACGA TAGAACCAAA 1680 

GGGAGGTATG CATGAAACGG ATAGCATATG TGCCGTTGTG CGCGGTAGTT GGTGGCATGT 1740 

GTTCGATGTG GGCACAGAGT GCAACAGATG TGATGGGTAG CTTTAAGAAA ACGGCGGAAA 1800 

CAGGCACAAT GGGTACGCAA GCCCGCATGG TTGTCCGGAA GGCGGGTAAG ACGGTGAGTA 1860 

CCTTAGTACT TAAACAGTAT ACCCGGTATG AAAAGAGTGG AGAGCAAAAG ACTCTTATAG 1920 

AGTTTTTGTC TCCGTTGAGC GTGAGGGGAA CACGCTTCTT ATCCCTGCAG AAAAAGGACG 1980 

GGGCGTGGGA GCAGTACCTC TATTTGCCCA AACTCGCACG CGTCAGGAGC ATTACAGGGG 2040 
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GGGATGCCCA CGCTTCGTTT ATGGGGACGG ATTTTTCGTA TCACGATCTT TCGCTTGTTG 
GTGGGGTTGC TGATCTTGAT GAATGTACGC TCGACGGTAC GGAGTCG T AC GGGGGAAAGA 
TGTGCGTGCG CATTCAGACA CTGTC ACAC A AGCCCCAGGC GCGGTACGTC AGGGCGTTGC 
TGTGGATAGA GCAGGAAACA GGTCGTTTTG TGAAAGGGGA ATTTTTCGAT AAAAAAGACA 
AGCGCGTGAA GATCATGACG CTTTCTGATT ACGAGACTAT CCAGGGTGTA GATACACCAA 
AGACGGTTGT GCTCGAGACG ATCGCCCAAC G CAT AC T AC A ACCATTCACC TCACGAAGgT 
TGAGTATCAC ATGGACATCC CTGAGAAGGT GTTTACCCCT GAGTATCTAA CCCAAACCGA 
TCGGTGAGTG TTGTGGCTTT TAGCGTGTTG TTTCTTCGTG CGGTATGGCT CGGTGGCTGG 
CGGTCGAGCC CTTCTTGAGC GTCTCGAGCG TCGAGGCGCC ATCCTGCCGT ATTGCGTTTA 
GGAAGTCGAT GACGCTTGGG TAGCGCTGAA AAAACTC AC C CCACTTTGTG AGGCGTTGCA 
TTTCAAAGTT CTCAAGTgCg nTGCGCTTTG CCGCTTGtGC GCGAGTGCAC TGAGGTGTTC 
GGAACCcTTC GCTTCGAACG CGCGGTAGGC cTGCTCTGCA GTGTGGTACA GGACCATGTC 
TGGTATGTGT ACCTCGGAAA GCACCACCTC TGAAATTGCG CAGTGGGGAA TTTCTTTTTC 
GATTGCGCGT TTCAGTTCTC GTGTGGCAAG CCCGTACTGC GTGTGAACGC GCTCGTAGAG 
GGCTGGGTCG GCCATGCAGC GCGCGATGAA ATCGTGCGAC ACCTGGGTGA GTGCCGCTTG 
TGCTGTAGTG TCCACGTATC GCTCGAGCGA GTTTTGGTCA GTGATCCTTT CTCGCTGCAC 
GGTATCCAAC AGGAAAGCTT CTTTGAGAGC AACGCGCGCC GAGAGCGCTA AAGTCCAGTC 
AAAGGGTGCG TGTTGATCAA GCcAnTGCGC GTACTCCTTT GCCGCGGGCA GGACGTCCTG 
AACGGTGACG GAGACCGTCT GCTTCTTCAA CTCAAAGGCA AAGAG TCCGC ATTGGACGGG 
AGCAGCGGCT CCCAGCGCCA GAGAAATCGC CCCGGAGCGA TGAGCGCGTG GTGGTAACCA 
CCTGAGCGTG ATCGCATAAC CCCATAACTT CCCGCCGGCA GGGAGAGCTG AAGCCATCCC 
CTCCACAGAA GGTACGTACT CGCGCCCAGG GCACAGACAA GCGAAAAACA AAACGTCCGC 
ACACGCATGA GTAGGGGAGC CTAGCAGTTT GTGTATCCCA ACGCAAGAGT ATTTGGGCGC 
GCAgTATATG GTGGTATCGT GCAGTCCTTT TGCACTGTCC ATTGC TGAG A CATACCGTGG 
TAATTGAATG GGCAGCGCCc TCTATGGTAG GGTCCGCCCC TATGGGTCGA GACTCGCCGG 
CGGGTATGCG CGAGGCCGTT TACTTTCTGC ACCGGATGGT GGTGTGTCTG GGCGTGCTGC 
TGTGTGCAGC GTCGCTACTT TATGTGTTTG GGAACTTTTC TCACTTTCTT GATAAAAGCC 
AGTTTATTAT TTTACGTTCA TGTGTCGGCT GTTCAGTACT GTTAGTGGTT GCCTGTTTGT 
GTGCGGGCAG TTTTGAGCTC TACTTTTTTT TGACGCGTAg TGACGCCCCG TATGGGCGGC 
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TGCTGTGTAT CACCGTCGTG GCACTGCTTT TTGGTATGGG TGCACTTGTT TTCAATACGG 
TAGTGCTCAT CGTGGCTAAA GGCACATGAG AGATTTACGA GGCACATCTG CATCTTTTAC 
TTCACACGTT GTAGTTTACA GCTGGAACTG AGAGCCGAGG TACACTTGGC GTACGTGAGA 
GGATCGTACT ACCTCTTGTG GACTGCCCTG GGCGATAATA TGGCCGCAGT GTATGATATA 
GGCTCGGTCA GTGATTTGTA GTGTTTCACG TACGTTGTGG TCCGTGATGA GTATGCCAAC 
GCCTGAGTGC GCAAGACGCA CGATAATGCG CTTGATATCC TGCACGGCGC AAGGGTCGAT 
CCCGGAGAAG GGTTCATCAA AAATTAAGAA GCGTGGATTT ACCGTTAATG CACGCGCAAT 
TTCCACGCGT TTGCGCTCGC CACCTGAAAG CGTGTCTGCC CTTTGATTTC GCACATGGGT 
CAGCTGAAAT GCTTTGAGCA GCGCTTCGCA TCGCTCGGTT TGTTCTGTGT AACTCAGATC 
GCGGCGCATT TGCATGATTG CGCGCACGTT TGCTTCTACC GTTATTTTTC TAAAAATAGA 
CGGTTCTTGC GGTACGTAGG ACACGCCCAT GCGCGCGCGC ACATGTATGG GTAGCGGCGT 
TATGTCTGTG CAGTCTAGCA GGACGCGCCC GCTATCTGGA CGGCACAGAC CCATTACCAT 
ACTGAACGAT ACTGATTTGC CTGCTCCATT GGGACCGAAC AGCCCAACTA TCTCTGCTTG 
GTGTACAGAA AAGGAGACGT CATGCwmCAC GTGCCGTGTT CTAAATGTTT TATTGAGTGC 
AGCGGCCACG AGGCGCTTTT CTCCCGGTGG AGATTCATTC CAGGAAATAT TTTTTTCTTT 
CTCTGTCTCT GCCTTTAAAG TATATGTGAC AwygTGCTTT GCACGGAGTC TGCGCGCGGC 
AGACAGGAAG CGTTGAAATG TCACCGGTGT TGTTCCCATT CGTAGGGTTT GAGAATTGAG 
CCTTGCACTT TTCCGTGTAG CTGAATTTCT CTACGGGCCA TATTCAACGT GATACGTTCT 
GCCTGAAAAA GATTACCGCG GTCGCTAACC CGTGGCGCAC CGCTTAATTC AAGGAATATA 
CTTTTGCGGT AATACATACC GAACATGGCC TGACATTGCA GGTCTTGGTA GGTAAGGGAC 
ACGTTCATTT GTAAGAGCAA TGTTTCGTTT TTTTCGTGGT AGACTATGCG CTCTGCACGT 
GCCATTACGT TTTCTCTTTT GTCAGAAATC TGCGCAGCAC CGAGTAGTTC GGTAGTATGC 
GCATTTCTGC TAAAATGCAG AGATTGTGCA GAGAACGTAA GGTGCTGTGT TTTTTTTTCT 
CCGCGCACGT TGCCGGTGGC GGTAACAAGA TGATAGTCGT CGCCGGAAAT TTCAATCTTA 
TCTGCGTGGA TTTCTAAATC GGCAAAGTAC ATGcGC gCGT TTCCGCGGAG TACAGTGCGT 
GGGGCGTGGG GGctCGGCAG AGCCTTC TAG ACTGTCTGCA AAAAAGCGCA CCTTACCTCG 
CGCGTGTGCG yGTTGCCACA CGCACAGGGA CATGAGAAAA AGTGCAAGGA GTACAGTGCG 
CAAGAAGGGT ACGGGTGAAA AAGCAGGGTA TGAGGACGCG CGAGTCATGG ACTGAGATTC 
GGTGCGGCGT ACCGTGACGC GCGGGTTTTG GCGAAAGAGT CAAGATCGAC GTCTATGTGA 
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ACGCCATTTT GAAACACAAA GCGTTTGGTG 
ACAACAGCCC CTGATGCATC GCTGATGCGA 
GCTCGCGTAT TTTCCCATCG AAATGCGCGT 
AAGCAGTCCA CCTTTTTCCC GAGGTAGAAA 
CCTGCATGCC CGCGCACGCT CGCACGCCCA 
CGTGCAGTCC AGGTTTGGTC GTGGTGATAG 
TCCAACAGGT GAGTGTTGTA GCGATCGAAC 
TTGCGCGTGG CGGAGGGTTG CTGTATC C AG 
AGAAGGAGGC AGGAGACAGG AGCAAAGAGG 
TACAACGGGC GTAgCcGGAC AGATGCCCGG 
AAGTAGGAAA GAAGGACGCA CCTTAAAAGC 
AACAAACCTC GCAAGAAGGG CGAACGAACT 
CCCCTATGAA GATTAAAGAG AAAAAAGGCT 
TTGCCTATAT GTTCGTAGCA GCCGTCCCCC 
GGGCACGTGA CCTTGCATCT GAATTGCACG 
CGCTGaCACA GTGCAGACCC TCCAACCTTT 
CGATGAGGGG TCGGTTGTGT TTGCCACGCG 
CGCATGGGCG GTGTATCCTG AGCATGCAGT 
ACACCTTGCA GAAATTGCTG AGCCAGGCTT 
CTTTTCCCCA GGGGGAAATG CTGTTTCCtC 
GTGTGTTGCA CACGGCGCCT ATAACCGCnT 
GGTTTTCTGA TGGGAAGGTG ATGGTTGTAm 
ATCCGGGCGG GAGCACATAT GAAATTGTGT 
TTGCTGCGTG CGTGTGTGGT TTGGACAGGC 
TGCAGTGCAA GATTGTTCAC CATCAATATT 
TGAATTTTGA TACCGAAGGG CGCTATGTGG 
TTGAtTGCCa AAGGTTAGAG ACAAACATTA 
GCGTGCAGCC TGAGTGCGAT GTTGTGACGG 
TTGCTGTTTT TGAGCGCGCG GTGCATAGGG 
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CGTGCATTGA CGCATAAACC 
ACAGGGGAGA AGTGATCGCT 
CCTTCCAAAC GCAGACCCTC 
ACAGTACCTT CGCAGTCAGT 
TTGCCGTCGT AGCGTGTAAA 
AACTCAAGCG TTTGAGCATG 
GTCACTTGGA AAAACGCGAT 
CGTAAGGCAC TGCTGTCGGC 
AGTCGAATGC GCACTACCGC 
GCAGAAAAAG AAGGACAGAC 
AACAACCCCG CCCCCACAGC 
TGAAAGAGAT CGCGCGCCAT 
ATTTCATCTC TTTTTCCGCT 
TCGGGGCTGA CCCTTACTTT 
AAkAGCGTCC TGAGCGCGCG 
CATGGTGGGG GAGTACTTTG 
GGTTACCCAG CGCCTTTCTG 
GCGCACGCCT GTTTTTAACC 
TGTGCATATT GAAgCGGATC 
CTATGACGCG CGCGGTGTAC 
TTCACTCTTC TGCTGCAGGC 
CtGCCGACGG CACCGTCAGA 
TTGGGGTGAC TCTCTCTGCA 
AGCGCGTTAT CCTGGTGTCT 
TGGAGGGCGC GTTACGTCAC 
TATTTGAACA TGCACAAGGG 
TCCCCCTGGT TGGGGATGTT 
TGTTAAGCCA GAAGGAGCAG 
TGGGGGATGT GCGGTTTGAC 
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AACCCCGGTG 5580 

TGTG AACAAT 5640 

ATGGGGAAAG 5700 

GAGCAAAAC A 57 60 

GTGGATGTCC 5820 

CAGGCGCGTT 5880 

GGTAGGTGTG 5940 

GCAgCAAACC 6000 

TCCTTTACGC 6060 

AACGCGTCCC 6120 

CGCGAAGCAC 6180 

TGTACGTTCT 6240 

CTATTTTTGA 63 00 

TTGCCTATTT 6360 

GTGCsTGAwA 6420 

GCTATTTTAC 6480 

CTTCTACACA 6540 

CTGCTGGGGA 6600 

GCTTTTTTCT 6660 

AACGGTGGnC 6720 

GCGGTTATCG 6780 

TGTGCATTCT 6840 

GATGGCACAC 6900 

CTTGCGGATG 6960 

CAGCTTTTGA 7020 

GTAGGGGTGA 7080 

GTTGGTATGG 7140 

CGGTGTCGGT 7200 

GC ACAGGATG 7260 
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TGTCATTGAC TCAGGGTGAA AAAAAATTCT TTCTAAGTAT CGATATGCTT CTTGCACGCA 
TTGACATTGC AGGGATCCCG TAAGGGGTAG GGACAGAGGG GGTGTGCCGG CGTGCGTACG 
ATTTTTGTGG GTGTACTGTT GCTCGCGATT ATGGGAGAAG GGCGCTTGTG TGCGTTGGAA 
TGGCCTGTTG ATAAACCTAA GTTTTTGTCT CTTTTTGGAC AGAGTGTGGG CGCAGGTCTG 
TTACAGCAGG GATTGATTTT TGATGGAGCA GACTCTGCCA GGAGAGCGTG GATACGCGGT 
ACGTACTGCG GGGTaCGGAC GyTGCGTGAT GCGACTTCAA AAACATCGCC GTGCGCGTGT 
CTTTCCCGGC GCGTTAGGAA ATGCGTTGAT TTTTGCGCAT GAAGACGGGT TACAGACGGT 
ATATGCAAAT TTAnCGAAGC GAAAAACGCG CAGGATTTTG GTTCTACCGC GGAAGCAGAA 
TCCGGGGTAA CGGTCGGATA CGCAGGATCA AGTGCGTGGG CACCTCCAAA CAGTTTTGTG 
TTCCAGGTGA TTGATACAAA AAACAAAGTG TATCTCAATC CCTTGCTCCT GACTGnCTTC 
GGTGTCGGAC ACCATAAAGC CCACCATTCA GGATGTGGTA TTGGCGGGAA AG AC AGGGG T 
GTTGGCTCTT TCGGGGACAG CAGCGCCGCG CGATGCCGAC GGGTATGTCT ATACACGCAA 
GCGCACCCGT GTGCACAGGC GCGTTACGCA GGGaACCTAT CGTCTGTATG CGGCAGTCGC 
AGATGTGTTA GAGCATGGTA CCCAGACGTT CACTCCGTTC CAAGTGCATG TTGTGGTGAA 
CGGATCGGAA GTGAGCGCGG TGTCCTTCGA GTTGATTGTG GCGAAAGATT CGCAGGCGTG 
TCTGTCAGGG TCGCTTTTAA ATGAACGCCT GTTATATGAG ATGAAGGGTC GCGTGTTTTT 
GGGGAGCGTA GTGCTCACGC GTGGTACTGC AGAGCTTGCG ATTAGCGCGC GTGATATTTC 
AGGCAATGAA CGAACGGAAG TGTTCTTTTT ACAGGTGGAG TAAGGCGTTC GTAGTTTTTT 
CATTTGTACA CCGGGGTGTG TGAGGGGGTG TGGTGTGGAT AGGACGGGTG GATACGTGCG 
GCTTGCGCTT GCAGCCCcTG CGGTGCGTGT TGCGGACTGT GCATACAATA CCCAGCGTAT 
GATTCAGACG GTGCGTCGTG CAGCTTCATG CGGTGTGGAC ATACTATTGT TTCCCCGTCT 
TTCGCTTACA GGGTGTAGCT GTGCGTCTCT TTTTGCTCAG GATACGCTGC TTTCGGCAGT 
CTGCACGCAC GTATCTGCAC TGTGTgcTGG CACTGCTGAT TGTCAGCTGT TAGCGCTTGT 
GAGTGTGCCC TGTTTTTTGC GCACTCAGGT GCcGTGTGTA CTGCGCTTGT CGCACGAGGT 
CGTGTTCTAG CACTGGTTGT GCAGGATACC CTGGCGGCGT GTGGCGCGCA AAAAATGCAA 
GTGCCCTGTG AGGTCCTGTA CGGTGGTGCA CCGGTGCCGG TGTACGATGT GCAGACGTGT 
TTTGAAAGTG CAGAGGGTCT TTTCTCTTTT TGTGTTGGTG CTATGGATGG ATCGGTACCT 
GCCACGCTGG TGTTGCAGGC CTACGGTACG CCAAGTACGG CGCAGACACC GGATATTTTT 
GCTGCGCACG CTGCGGCATA CAgTGCACAG CACCAATGTG CGTATGCGTA CGTAAATGCG 
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GGGTGGGGGG AGTC TAGTGC TGATGCGGTG TATGGCGCGG AAAGTGGTAT TTTTGAGTGT 
GGGCAGTGTG TGGTCCAAGA CTCATTGCAG GAGATGCGAG AACGGGGGGA GCGTCCGGCG 
CACGCGGTGC tGGaCTGCAT GTTAGTGCGG ACGTAGATGT GTCTTTGGTA CACTTTCGTC 
GTCGTGCGCG TAGcgGACcA TACCACTCTG GGTGCATCGG CTCCCTGCGT CACGCTTCCT 
GCaGGCATAT TTGCAGCGTC AAAGGCGCAC GCCACGCTGC GGCGTCCTCG CGTACCCTGT 
CCTTTTTTTC CGCCTGCTTT TCAAAAATCG CAGGATGCGg TGCCCCCGCT CACGGGTGCC 
GTGTGCCTCG CTGTTTCTGC ACCGTCAGAC ACGCAGGACG GTTTTTTGCA AAGAACGATA 
GACTTAGCCG CGCAGGgCGT GGCACTCCGT CTTGAACACA TGGGC TGTAG GCGCCTGGTG 
GTGGGTGTTT CAGGAGGTGT TGATTCGGCG TGTGCATTGC TAATATGCGC GCGCGCGTTA 
GATTTTCTCT CGATTGCGCG TACACAACTT TATGCGCTAA CGCTTCCTGG CTTTGGTACT 
ACGTCAGGAA CGAAAGGTGC GGCGCAGGAG TTTGCGCGTG CGCTCGGTTG CACTGTGCAA 
GAAATTTCTA TTAGCGCGGC AGTGACGCAT CATCTCCATG ATATTGGGCA TACGATGCAG 
CAGTGTGACG GTACtATGAG AATGCACAGG CGCGCGAACG GACGCAGATT TTGTTAGATC 
GTGCTAACCA GCTTGATGCG CTCATGATTG GTACGGGAGA TGCGTCAGAA GGTGCGCTTG 
GTTGGGAAAC CTTTGGGGGC GATCACCTTT CGCTGTACGC AgTGAACGCA TCTTTGCCCA 
AAACCGTGGT GCGAGCCTTG ATTTCCTATG CTGGGCGTGT ACCTGAGCGT TTTGTGTGTG 
AAACTGATTC TCCCTATGCA CCGCGCGGTG CTGCCTTTTC TCGCGTTTGT GCAGCTATAG 
TTGCACAGCC GGTGAGTCCT GAGCTCATAC CTCCTTGTGA TGATCGTATT GTGCAGTGTA 
CCGAGGAGAT GCTCGGTCCT TATGAATTGC ATGATTTTTT TCTGTATCAC AT AAC GGTG A 
ACGGTTTTGG TCCTCGAAAA CTTTTTCGTG TGGCCGCGCA TGCgTTTGGA ACTGCGTATT 
CTTGCGCGCA gcTATGTGCa GCgcTGCGCG TTTTTTTTAC CCGCTTGTTT TCACAGCAGT 
TCAAGCGTTC TTGTGTGCCT GATGGGCCCG GTCTTACGGA AGTGAACCTT TCCCCTCGTG 
TGGGTTTTTA TTTTCCCAGC GACACTTCCG GTGCGCTATG GCGCGCAGAG CTTGAGCAGC 
TGGcTTGTGG GGAATAGACT GGCACGCAGG ATTTTTAACA ACTGaTATGG AGGTGCGTAG 
GgCGTGGTGC ATACGCTTTT TTCTTGGGTT TCAGCGCATA TTCACTCGTT ACCTATGGTT 
GTGTTTGTCA GCCTGCTCTT GGCAGGAGTG CATGTGCCGG TTTCTGAAGA TGCGCTGATT 
GTCATGAGTG CATTAGTATG TCGACAGGAT GGAGCATCTG TGCCGAGCTT TCTAGGAGCG 
TTGTATGCAG GTGCATTAAT AAGTGATTAT GCGGTGTATT TTTGGGGATA CCTGTTGCAA 
CAGGGTGCGT TGCGTGTGGC TGCTCTTGAG CGGACGCTCG CGTCCTGCCG CGCACAAAAG 
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ATAGTCACAC TTCTTTCGCG TTATGGCCTT TGGGTATATG TGCTTGCGCG TTTTGTCCCA 
TTTGGGGTTC GTAATGTGGT TTCGCTGACG TCGGGGTTTG TGCGTGTGCC GTTTGTGCGT 
TTTGCGTGCT ACGACGCACT CGCAGCGGCC TGTAGTATTT CTGTGCTCTT TTGGATGACC 
TATTTCCTTG GCTCTGTACA GCGTATTTCA CTCAAG GTTT TTGCGGTGGT GATTTTGCCT 
TTGTCGGTGC TGGGTATACG GGTGTTGATT GGCCGCCGGC AGAAAACCAC AGGGGATGGA 
GTGAGAATTA CACACGATGA CGTACAAACT AATGTAGGAG TGAGGTGATG AGCACGTGTG 
CGCAGGCTTT TTATCGCTTG TATGAAATAA TTGTGCGGTT GCGTGCGCCG GACGGGTGTG 
CGTGGGATTt GGCACAAACG CCGGTAAGTA TGTGTTCGTC C TTTTTGG AG GAGACGTATG 
AAGCGC TTG A GGCTATCCTC GAAgAgGrCG AnGGCACAGC ATTCGTCGTA TGCTCACGTT 
CAGGAGGAGT TGGGGGACGT GCTGATGAAT GTGTGTATGA TTGCATACAT GTATGAACAG 
CGAGGGGTGT TCTCGCTTGC AGATGTTGTA ACTGCATTAA CGGAAAAGTT AATTCGACGT 
CACCCCCACG TATTTGGGCA AACAGAAGGA TTTCCTGGAC CGGAAAATCC GAAGCGAGCA 
CAAACAGCAC AGGAGGTGTT TGATCAGTGG GAACGGATTA AAACACAGGT GGAGCGTCGC 
CGTGCAGCTT CTCCGTTAGA GGGcATTCCT CGAACGGTTC CTCCCCTCAT GcGCGCGTCC 
AAAATGCAAA AAAACGCGTC GCTGnCGCGT CTTTTTTGTC CAACACGCAC GGAGGTGGTA 
C G AG AATGTG CGCGTACCTT TCGTGCACTC CGTGCGATGT CAGAGAATTC TGCCGAACAA 
TCCGCCACTc AAGCAGCGCA TGTTGCAGTA GGTGCGCTGT TGACTGCAGT GATATCGTTT 
GCACATCTTG TGGGGGTAGA TCCGGTGCTC GCCCTTATCC GCGCAAATGC GGACTTCGTG 
CGCCGCTTTT CGTGTGCCTG TTCTAtACCT GcCATTTCTG GAGGTACTTC TGTATTTTTG 
TCTCGCGCGT GCCATAAACC ACGTCGCGCA CGCACGCGGG CGTCTGCGGT GCGCAGGCGC 
GCACGGTcAC GGc GACTGTT TTTTACTCGA CACAAGCTGG GGAATATGCT ACGGTAGGAC 
GCGTCCCTGT CTCCGTGTGT AAATTGTTAG CACGGGCAGG GTGCGTGTTG AAGAAGAGGG 
GGCTTATGAA GACGTTGCAG TGTGATATTT GTCGGAAGGA AGTGGACAAT TCGCTGCCCG 
AGAGGTTGTA TTGGACATTC CGGGAGTATG ATGTGTGTGA GGACTGTAAG GAGTCTATTG 
AGGACAAGTT GCGCCCTATC ATACGTACTC ACCAGCCTTA TTCTCAGGGT TGGTACGAGA 
ATCAGTTCAT GGGTATGGTG CAGCGCGGGG TGTCTAACCG TCGTCCGTAA GTTTTTGATG 
TCAGTGTTTC GTGCTTGATG TGTGArGTAG GGACGTAnGG GTGTGATCCT TTTTTCTCGC 
GCGAGGTTGT GGGCGAGGGA TGGTGTCGCT CGCGCTTATG TTTCTTTCCT TGGGCCkCGG 
CGCTGTGTTT TTTGTGCgTC CCgGTGTAcT GGGAcGGTTC CTCTGTGCTG TTCGTGTGTG 
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cAGGATCGGT TGTACGCGCG 
CGCTGTGCCA AGCCGCTTGT 
GAATCGGGTG AAACGCCTGC 
GGGCGTC CTC TTATGTCGTT 
TCCCGCATTG CCGGGTGTTT 
ACCGAGTTGG TGCCAGTGCC 
GTTGAGGACG TGTCGCGTCG 
CGAGTAGAGG GTCGTTTCGC 
GCAGGGAGTA TAGAGCTCGG 
GACGTATTAC CACGTATGCC 
CGAGCGTGTG CAGGGTTTCT 
CTTTTAGAAA TTCCTGAAAA 
TTGTTTTTGC AGCATAGGTC 
CAGGTGTAGC GTCCGTGGAG 
GTGACCACAA GCAAGTCGCG 
CGTGGCGCTG TCCGCAGATA 
TCAGGACTAC GTTTGCCGTC 
CGCACGGTAC TTGGGCAGCG 
TGCTTTCGTG GGGTATAAAT 
CGCGAGCATT GCTTGGGGGG 
GCTTTTGTCT GTTGCCTGCG 
CCAGTGAAGT TCTCCTTGTG 
TACCCCCTTG CTGTCTAATA 
TGCAAGAGGT CATGGGGTGG 
GAAGTACGAA CTGCTGGACG 
TAAGCTTTTG CGGTACACTG 
AATGTTGAAA ATTGGTTGAA 
CGTTGACTTC AGGTTCATGA 
GAAATACACA CTTTTCAATT 
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CGCACATGAC 
TTCGG CGCGA 
GCTTTGGCGT 
GTGGAAGACA 
TTTGCGTACA 
GCCgCGGCCA 
ACTAGAATTG 
GCAGAAAACA 
GGCGCACGCT 
ACGATGGACG 
CGTTCTTTTT 
TGCCGCAGgA 
TCTTAGGCAG 
T AAAATC C AG 
TTCAACCGCG 
TGCGTATCAA 
TTGTGACCGA 
AAGCnTCGAT 
TAATCGTCCG 
TGTCTGCCAC 
CAGAAAGCAG 
GTTGCGGGTT 
GACGCATAAG 
AAAGGAGGGA 
CACTCAAGCA 
ATAAGCAAGA 
ATGACGATGA 
TGGGAGAACT 
TTATCCAAGC 
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TTTTTGGAAC ACCCTGAGGA TTTCTGTAGT 12540 

GCGTTGTGCG TCTCTTGCCG TGCGCTTCGA 12600 

GTCTTTTCAC TTTTGCCgTA CCTGGGTGTG 12660 

CAGCAGGAGC GGAATTTTGA TGCTCTTTTT 12720 

GCGCGTGAkC GCTc cTTCGT CACTGCAAGT 12780 

TGCAAGATGG CTGAGAGAGG ATGGGACCAG 12840 

GCTGGTTTTA CCGTTAATCG TGCGTTGGTG 12900 

TTGTCGCGCG CTGCGCTGnT TGAGAATCTT 12960 

CGTGTGCCGC GTGATGCCTT GATTATCGAT 13020 

CGTGTGCGCT GTGCTGCGCT CCTCGGGCAG 13080 

TGCGTGAGGC GTCAATTAGT TAGCGAACTT 13140 

CGTACGGCCC AGCAATTTTT CTCTTATATG 13200 

CACTCCGTgC ACCGTGGTCT TCGTGCGGTG 13260 

TGATGTGCAT GCATGCGAAA CTCACGCGGG 13320 

CGTGGGGTTC TTCC AGAGGA AAGCCCGATG 13380 

CTGCGATTGT GGGTATGCCA AAACCCATGT 13440 

CCCCGGGTAG ACTCTCTAGG GCATGGGCGT 13500 

GAGTTCAGCA CTGAGTGCAA TGATTCGGCG 13 560 

TATGTAGGAG CATAGCCGTT CTTCCCCCAG 13620 

ATCAAACAGA GCAGCGGTCG CCTTGTTGAC 13 680 

TACTGCCACC AGGAGCGTAA AAGTATTGCG 13740 

TGCTGCGTGC AgcTGCTCAA AAACGGCGTG 13800 

GGTGCCAGTA AAGAGAGGTA GTTTAAAAAG 13860 

AATGAACGCA C AGGATTC AG AGAGTTTCCT 13920 

TATGCACCTC GTGGTTCAGT TTTCGGATAT 13980 

CGAGCTTAGG AAAGCTTGTC TCCGACTTGG 14040 

TGGAATGCTT GCGAAGAAGT TCCATAACCT 14100 

GTATTTCTAG GCGCTTATTC TGGCGAGTGG 14160 

CTGCAAGGGT GATCAGTACG AGGAGCACGA 14220 
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GCATCGCCCC TGCGCTGTAC AGCCtGCGCC GATAACGAGA CCGATCCCTG CTGTCACCCA 
AATAGTAGTA GCAGTGGTTA AGCCTTTCAC GTTTGCACCC ATTTTTAAGA TGGCACCGCC 
GCCGAGAAAT CCCATGCCGG AAACCACCTG TGCAGCGATG CGCCCGGGGT CGCCGATGTG 
GTCTCCGGTA ATCTCACTTA CGCAGAGGGA CAGGAGCATA ACGCCCGTAG CACCGACACA 
GATGAGTGTG TGAGTGCGCA ATCCCGCTGC CTGTAACTTT GAGGAGCGCT CCAACCCGAT 
AGCAAGTCCT GaGACAAAGC TGaGCAAGAG CCGGaCAACA ATAACGGaAT CTGTAATCAT 
GACTTTTCTC TTAGGGcGTA gcAGGaTGCA AGTGCCTCGA GGGAGACTTG AACTCCCACG 
CCGGTGAAGG CACTAGCACC TGAAGCTAGC GTGTCTGCCA ATTCCACCAT CGAGGCAAGA 
AAACCCTTCC ATGGTGGGAA ATATAGTTTT TCTAGTCAAG GGATTAGAGC AGCTTTCAGG 
GCACGGGATG CAAAGGCGGC GTACTTGACA AAATGCCAAT TCCAATACAC GCTGcCCGcG 
GCgCTGCGCg TGGCGCCGTG GGC TATTAGC TCAGCTGGTA GAGCAACGCC CTTTTAAGGC 
GTGGGTCGAT GGTTCGAATC CATCATGGCT CAGAGGTGGG ATTGGTGCGC AACAAGGTGC 
GAGTTCTTGC GGTGGTCGCA GCGCTTGCGG CTGCGTGCGC GGTGGGCTTC TTTCTAGGAA 
GGTGGTTCGA CTTCTCTGCT AGGTCCTCGG TGCTCGAAGC AGCTGATTCC CTCTCCGTTT 
CTTCTTCGGA AGCGGCCAGC TTTTCCACGG TTGTTGCAGA GGGGGACCCG TACACCGTCG 
ACGAGCGGCA GAACATCGCC GTTTACCGCA GTGCCAACGA GGCCGTTGTC AACATTACCA 
CTGAGATGGT AGGGGTTAAT TGGTTCTTAG AGCCCGTGCC TCTCGAAGGT GGCTCTGGGT 
CTGGCGCTAT CATTGACGCC CGCGGGTACG TGCTCACCAA TACGCACGTC ATCGAGGGTG 
CGTCTAAAAT TTATCTCTCG CTACACGACG GCAGCCAGTA CAAGGCAACT GTCGTGGGTG 
TAGACAGGGA GAATGATCTT GCGGTGCTTA AGTTTGTTTC TCCTCCTGGA GCACGCTTGA 
CAGTTATCCG CTTCGGTTCT TCGCGCAACT TGGATGTCGG ACAAAAGGTG CTTGCCATCG 
GGAATCCCTT TGGACTAGCG CGTACTCTTG ACCGTCGG 
(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6234 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TTTGGAATTT TGTGTTGTCG TTCACGGTAA ATAATTTGTA GCGTTCCGTG CCCGTTTTGA 
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AACGGTCCGG GGCTGCGTCC ACCAGGCAAG GAGTGATATG AAAGCGACGC TTACCTTTGT 
CTTTATGCTC CTTACGTCGC TGCTGCAGGG TCAGTCGCAA CACATCACGC GCTTTGCCGT 
CATAGATGCG GCCCGCATTT ACTCAACCTT TTGGCGCGAT TCGCCGTTCC TGCGCrATtA 
TGAATCTAAA AAAGCACGGC ACCAGGGTGA AATTCAGAAA ATGTC TGATG AGCTCG TAG A 
nTCCGGGCAA AAAAAAGTTG ACGCGCAGAT GCAGCAAAAC ATCGCGTCAG TCCAAAAGTA 
CGAGGTGCTC ATTGCGTCAA AAACCGCGCT CCTGTTGGAG TATTCTAAAA CGTCCAACGA 
CGAGCTCACC GCGCTGCGCA AAACGCTCAT CGCAGATGAC GCATTCTATG CAAAACTCTA 
CGCCGC TATT AGGCGAATTG CAGAAAGTGA AGGCTACAGC ATC GTCTTAG ATCTGCAAAA 
AAACGCCGGA ATACTCTGGT ACAGCCACTC GGTCGATATT ACCGAAGACG TCCTGCGGGA 
GCTGAGCAGC TCGTGATGCA CCGTGAGCAC CGCGTCTcCT GCCTCCTACG TGTTGGcCCA 
GGAGCGTCCA CGTGAGGTCC CTCGCGTCAG ATACCCCTCT CATGCGTCAG TACCACGCCA 
TTCGGGCACA GCATCCGGAT GCGGTCCTGT TCTTTCGCTT GGGCGATTTC TACGAAATGT 
TCGATTCCGA CGCGCTCCAC GTGAGTACCC TCTTGGGGCT CACCCTTACA AAACGAAATG 
GAACACCCAT GTGCGGGGTG CCCGTCCATA CCGCGCGCAC GCACATAGCA CGCCTGCTTA 
AGCACGGTAA AAAAGTTGCC TTGTGCGAGC AGGTTTCTCA TCCTGTCCCC GGAGAACTCA 
CACAGCGCAA GGTAATTGAG ATTATCTCCC CCGGGACCGC AGTGGAAGAT GACTTTCTCA 
GTCAGGGATT TTCCCAATAC TTAGCCACCG TCTGTGCCTC AGACGCCACC GTCGCCTTTT 
CTTACCTAGA AGTCAGCACC GGCGCCTTCT TCATCACCAG CTTTCCCCGC GC CGAAGC AG 
CGGACGCATT GCAAAAAGAG TTCGGACGTG TCCAGCCGTC TGAGGTTCTC CTGTCTGCTT 
CAGTGCTCCG TTCACTGCCT GAACTTGCCG CTATCCTCAG TCTCTACCCC CGGCTCGTTC 
GTACCACCGG CGCAGATGCG CTTTTTAATC CCGAGCACAC TAAAAACCGC CTGCACCATT 
GCTTTCGCAC ACGCAACTTG GATTGCCTCA CCCTCCTGCC CCATTCGCCA GACCTCGCTG 
CCGCCGGGGC GCTGATTGCG TATTTGGAAG AAACCACGCG ACACCCGCTC TCCCACGTCA 
GTGCCATCAC CCGCTACCAT ATCCATGACT TTGTAGAAAT CG AnTGaCg c TACGCGCAAA 
AATCTAGAGA TACTTCAAAA TCTCCACGAC AGCACCCATG CGCATTCTCT TTTTGAAACA 
CTCAACTATA CACACACCGC CATGGGTACC AGGCTCCTGC GCTATTGGCT GCACCACCCC 
TTGCGCTCCC AGGAGGAAAT TCAAAAACGC CTCAGTGCAG TGGTCTTTTT TCATCACCGT 
CCCCACATCC TCAAGAacTG CGTGCAACAC TCTCGTGTGT TCGGGATGTG GAGCGCCTAG 
TCGcCCGCGT GGCGTTAGAA AAGGCGCACG GACGTGACTT GCTCGCCTTA AAAGAAAGTC 
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TCAGGGCAAT CCTTACCTTC CGCAGCCTCG AGCGCGAAAG TCCCTTTCCC CCAGACCTTC 1860 

TTCCCTCAGA AGGGGATACC. CCGGTGCTGC AGGAACTGTA TGGTCTTTTA GAACAGTCTA 1920 

TCAAAGAAGA TTGCCCCGTA ACGCTAAGCG ATGGGAACCT TATCAAGCGT GGTTTTTCTG 1980 

CGTCCTTAGA TGAACTGCAC CGCGTGCGTG AC AATGC AAA TGAAATTCTA AAACAATATT 2 040 

TGGCAGAGGA GCGTGAGCGC ACGGGTATCG GTACATTAAA AATGAAGTAC AATCGCATGC 2100 

TCGGTCACTT TCTGGAGGTA TCCAAAGGGC ATCTTTCTGC TGTCCCTGCG C ACTTT ATTC 2160 

GTCGCCGTTC ACTGAGCAAT GCCGATCGCT TTACCACCGA ACAGTTGTCA GAATTGGAAG 2220 

CAAAACTTGC CCGCGCCCGT GAGgGCcTCG TTTCCTTTGA ACAAGAACTC TTTGCAGATA 2280 

TCCGCCGTAC CGTATGTTCT CATACCCAGC TGCTGCGCAC GAACGCTGCA CGGGTGGCAC 2 340 

AGCTGGATGT GCTCCAATCT TTTGCGCACG CTGCGyTCCA GCATGGCTGG AGTCAACCGG 2400 

TCTTTATCAA AGAC GGTGCA CTTCGTATTA CGGGGGGCAG ACATCCGGTG GTGGAACTTC 2 4 60 

ATCTCCCCTC CGGGGAGTTT GTACCCAATG ATCTGACACT TTCTTCAAGT GAACATGCGG 2520 

TGTTGCCGCG cTTTGgsTCA TCACCGGACC GAATATGGCA GGAAAAAGTA CTTTTTTGCG 2580 

TCAGAcTGCG CTCATTTGCC TGATTGCGCA GGTTGGCTCC TTTGTCCCTG CAGAAAAGGC 2 640 

AGAGCTCACC CCCGTCGATC GTATTTTTTG TCGGGTAGGA GCGGCCGATA ACCTTGCGCG 2700 

CGGGGAaTCT ACCTTCTTGG TAGAAATGAG TGAAACAGCA CACATCCTGC GTGCAGCAAC 2760 

CCGCGACAGC CTTG TTATCA TGGACGAAGT AGGACGGGGA ACGGCAACTG AAGACGGTTT 2820 

ATCCATAGCG CAGGCAGTCA GTGAATATTT GTTGCATCAT GTGCGTGCAA AAACGCTGTT 2 880 

TGCAACACAT TACCATGAAC TGTCCCGTCT TGCCCACCCG C AG TT AG AAC ACCTCAAGCT 2 940 

TGATGTTCTA GAAACTGACA ATACCATTGT ATTTCTGAAA AAAGTGACGC CCGGTTCTTG 3000 

CGGCAGTTCG TACGGCATTT ACGTTGCGCG TCTGGCGGGG CTCCCTGAAT CGGTACTGGC 3060 

ACGCGCGTGT GAGC TTTTGA AACAACTGCA GCAGCGGGCA GGATCTGCTC CACGTGCGTn 3120 

CTnTGCGCAC GAAGCAGATG CAGTGGCTCA AAC AG AAGC A GTACACGCGC ACAAGGCAGC 3180 

GTCTAAACCG TGCGCGCagc GTGTGTCGGC AGATCTATTT ACTCAAGAAG AGTTAATAGG 3240 

CGCAGAGATT GCaTCGTTGA ATCCaGACGC CATTACACCG CTTGAAGCGC TGACACTCAT 3300 

CGCGCGGTGG AAACGCAGCC TCCGCGGTTC TGCAACGCAG CAGAGCAGCG CCATGACAAA 3360 

ACGGAAGGGG TAATGGTATG TTCCCCTGTT ACGCACGACG GGTATCGGGC ATGCGGCGCG 3420 

CGGCGTTTTG TCCATTCTTT GCGCTAGAAA CAGAGCGAAC AATATTCTGC CTACCTGAGG 3480 

AGAGAAAAAC GTGAATAAtT gCACTCCGTG cGTaCCTGAG TACGCGTGCT CCTGACCAGA 3540 
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TACATAGTGC TTTTGTTGCG TATTTGGCCA ATCTTGATTT AGTTGCGCAC CAGTTTCCGC 
AGATTGCTTC TGATATTGTG CAGGAGCTGA TAGATCAGCG GTCGTATGTA AAGTTAATCG 
CAAGTGAGAA TTACAGCTCT CTTGCGGTGC AAGCGGCGAT GGCTAACTTG TTGACTGATA 
AATACGCAGA AGGGTTCCCC CATCATCGCT ACTATGGCGG GTGTCAGAAT GTTGATTCTA 
TTGAGTCTGC CGCCGCTGCA GAAGCATGCG CGCTCTTTGG TGCTGAGCAC GCATATGTCC 
AGCCGCACTC CGGTGCAGAT GCGAATCTTG TTGCATTCTG GGCTATCCTT TCGCGGCAAA 
TTGAAATGCC AACCCTTTCT TCTCTTGGTG TCACCGCCgC TACGCATCTG AGTGAGGAAC 
AGTGGGAAGT ACTGCGCCAG AAAATGGGTA ATCAAAAACT TATGGGGTTA GATTATTTTT 
CAGGCGGTCA CCTGACCCAC GGGTACCGCC AAAATGTTTC AGGACGAATG TTTCGTGTGG 
TGTCCTACGC GGTGGACCGA GACACAGGAC TGCTCGATTA CGCTGCAATC GAGGCACAGG 
CAAAGCGGGA AAGACCACTT ATTTTACTTG CC GGATAC AG CGCGTATCCT CGTTCCATTA 
ATTTCCGCAT CTTTCGGGAA ATTGCAGACA AAGTGGGCGC AGTACTCATG GCTGATATGG 
CTCACTTTGC TGGACTGGTT GCAGGCGGTG TTTTTACGGG AGACGAGGAT CC AGTGCGC T 
GGTCTCATAT CGTGACCAGT ACCACACACA AAACGTTGCG CGGGCCACGC GGTGCCTTTA 
TTTTGTGTAA AAAAGAATTT GCAGAGGCGG TGGATAAGGG CTGTCCGCTT GTGCTCGGCG 
GCCCGCTGCC ACATGTGATG GCAGCAAAGG CGGTTGCGTT TCGTGAAGCT CGAAATGCTG 
CTTTTAAAAC CTATGCGCAC GCAgTCCGTG ATAATGCGCG TGCGCTGGCA GATGCCTGCA 
TACAACAGGG GATGCAGCTG CAGACAGGGG GGACGGATAA CCATCTGCTA TTGCTtGACG 
TGCGTCCGTT TGGACTGACA GgTCGTCAGG CAGAgCGCGC GCTGATAGAC TGCGGAGTGA 
CGCTCAACCG TAACTCGCTC CCCTTTGACC CAAACGGCGC ATGGCTCACC AGCGGACTGC 
GCATCGGAAC CCCCGCGGTA ACGAGCCTTG GAATGGGGCC TGAGGAAATG AAAAGAATAG 
CGCGCCTGAT CGCGCGCGTG CTCGGCGCTG CAACGCCTGT GCGGACAAAG ACAGGTGCGC 
TAAGCAAATC GGCGGCCGAG GTGCCCGGCG AGGTTAGAAG CTCAGTCTGC TCGGAAGTGC 
GGGAGCTGCT CGCACGCTTC ACGTTGTACC CTGAACTCGA CGAACCCTTC TTGCGCGCAC 
ACTTTACGCG TCGCCCTGCn GGACAAAACA CCTGCCGACG AAGGgACTTG AACCCTTaCG 
GGGTTACCCC AACAGATTTT GAGTCTGTCG TGTCTGCCAG TTTCACCACG TCGGCCCGCG 
CGCAgCCTAT CACACGAGGA ACAAAAGgTA CAGCTGTTCA TGTAGTCTTC TTGCGTGAGG 
CCCCGTGTCT CCCATTGAGG GAGCCGTTAT TTTTCTCCCA TGAGGAGTTT TAGTTCCCGA 
ATATCTGCCA CCAGTTTAGA GCGATCTAAA TGCTGATAAC GCGCAGGGAG CATTTCCTTT 
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CCTGTGCATT CGAGTACTGC CACTAAATTT TGCAGTTCAA TTTCGTGCGG ATAAGCAGGA 

GGGATAAAGT CTTCAATGGT GCGGGTGATG TCCTGTGTGG TTACCATCGT GCGATTTTCC 

ATCGCAGCAG TTAGCTGGGC GCGTACTAAA ATGGCTTCTA AGTCCGAACC GGAAACAGCG 

AATTTTATTC TGCGAATGAT TGCCGGTACG TGTACATCTT TGAGCTTGAT ACGTAATTTT 

TTTTTGAGTG CTTCAAAAAT TTCCGTTTTT TCTTTTGTGG TTTCAGGGTA GAAGAGCGCA 

AGATGCTCTT CTGCGCGTCC CTGTCGTTTC AGATCTATTG GTAGCAAGTC TGGGCGCGAA 

GTAATCAGGA ACCAAATAAT ATTGCCCCGG TGTTGGGTGT TACCCATAAA CCCTGCAATT 

TGTGCAAAAA TACGCGATTC ACCTGCCGGC GCGTTACGCC TACCAAACAC CGCATCAGCT 

TCGTCCACCA TCACCGCTAC CGGGGTAAGC GCTTTGAGGA TGTTGAGCGT TTTTTCTAGG 

TTCGACTGTG TAATGCCAGG CTGCGTTGCC TGGAAATTAC ACAAACGCAC CATGGGAATC 

CCAATTTCCC CCGCAAATGC GGAAACCATA AATGATTTGC CTGTCCCAAT CGGCCCTGAG 

ATAAGGTATC CCATTGGCAA CACATCTGCT CTTCCTTGCT TAATGGCGCG CACTGcGTTA 

TACAATCTCT TTTTTACAAA GACATTTCCT GcAACGTATG AAAGGTCGCA GGATGTGTCG 

ACAAATTCCA ACAAACCGCC TGCTTCGTGC TCAATAATTT CCTGTTTCTT CCTTTTAGAA 

ATGTAAGGTT GCAGAGTCCG TATGGGAAAG TCTACTGATA wTGwcCTCtG GCTcCCATTG 

CATCGATCGT TGGnACGTCT CTGCCGCAAG CTGGTGGAGG GTCACTAAAT TCAA 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1548 base pairs 
{ B ) TYPE : nucleic acid 
(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CGGATAAAGG ACGACTGGGC TGGCAGCGGG TGTGGGTTCC CACCTCCTGT TCGTGTCTTT 
TCAGGGTGTG TGCGCGTTCC GAGAAGAGGG CGTTTTGTGT GTGGGGAGGA GTACGATGGA 
TACGCATATA TGAGGCGCCG GGTGTGCACG GTGGTGCGCG CGGTGGTGTG TCTACTCAGC 
ACGAGTTTGC TGACCACGTG CGATTTCACT GGCATCTTTG CGGCAATTCA GTCGGAAGTG 
CCCATTAAAA CGCCGTCCAT CCCGGGGGCG ATTTATGGCC TGGTCAAGGC CGGGAGCAAG 
CTCTACGCCA CCAACGGCCG GCTTTGGGAA AAGGAGCTGA ACGGCACTGG GTCgTGGCAG 
AAAnTGTCTT CCTCGTCCGT TCCCACTGAC TCGGATAAAA AgGTTATGAr CaTTGCCACC 
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GACGGGaACA CGTTCGTCCT CGCCTGCGTG CCTGGCACGG GCGTTTACAA ACACTGCGTA 480 

AATGGCGCGG GCAGCTCAAG CACCGGCACA ACGGCAAGCC CCTCGACTGA AACCTGCTCG 540 

CAGCATGCGA CGCTCGTGGG GGGAACGTCC AAGCCCTTCT GGCTCGTGCC GGGAGGCgnG 600 

nGAAATAATG GGAACTGCGG TTGCGGGGGA GGGGGGGGTG GCTCCTCCTC GAGTAGCAGC 660 

TCGTGCATTC ACATCTGGCT CGTGCCGGGA GGCnGnGnGa AATAATGGGA ACTGCGGTTG 720 

CgGGGGAGGG GGGGGTGGCT CCTCCTCGAG TAGCAGCTCG TGCATTCACA TTAAGGTAGA 780 

AAACACGGAC GAACAGTTTC TCGATATGGG TGAGGGGTAC GTGGTGACCA CCAAGCACCT 840 

CTACACCAAA AACGGCTCGT CCAGCGCGGG ACCGGCGCAG TGTCCCGGTG GCGGTGGCGG 900 

CGGAGGCAGC AGCGGGGGTG GGGGTTCCTC GGAGTACACC AAAGCTTCCT GTTCCTTTTC 960 

CACGCCCATT CTGGCAAGCG TCACAACGGG TGCTATCACT ACATTCTCAC CAAAGAAAAA 1020 

GTGTACTGCA GAAAGCAGGA CACCGCTTCC TCCGCTGCGT CGTCACCAGC CCAGTGTCCC 1080 

TCTTCCCCTT CTTCTTCTTC CTCCTCCTCG ACGAATGCGG GATGCGAGGT GGCGCACGGG 1140 

GTGGACGACC CGCTGTGTCT TGCGATTTTT AAACACAACG GCTGCGAATA CTTGCTCATC 1200 

GGCGGCAGTC GGGGCTACGG GGAAATAAAG CTGGAAGCGA ACTCCAGCGG TACGAACGGC 1260 

ACCTGCATGC GATTGAAAGA GAG C AATGTG CACAAGAGTC CGGGCCAGTG GGGCGAGTCG 1320 

AGCCCCACGC CCAAAGCGAG CGCCGAGCAG TATCGGGGCA CGGTCGGTCG GTTTGC CGTG 1380 

CAGAAAATCT ACGTAgTTGA AAAAAATGGC GGTGGGAACG GTGTCGCCGC GGGTGGGGCG 1440 

GGCTGTCCTG CAAACGCCAG CAGTTCCAGC GGAGGGACCA GCAGCACGCA GCGTCCAGAC 1500 

CTCTACGCCG CAGTGGGGGA GTCGAGCGAC ACCTAnCACG GGGGGTTT 1548 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3172 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

TACGAAGAAT CGTACTGCCC ATCCCCATCC CgATCAAAAT TCCCGTCAAC ATTGTTGATA 60 

AAGTACTGTT CCTTAAAAAA ATCGAGTCCT GTCTTTCCAT TCAGCCCCAT TGCGTTACGG 120 

TGTACGTTAT TTACCAGATC TACAAAGTTC ATAGCCATAG TATCGAGCTT GCGCAsTTCA 180 

TCGCGCACGT CTGTGTCACG CAtTCTATCA ACGCTGCAAG CTTACCCCCA GAAAAGTGCG 240 
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CACGATCCCC TGAGTCCTTC CACACCACGG ACACATATCC TTCAGGCGTT GTTCCACTGA 
CAAGCCCAAG TGTCTTATAA CTCTTCCCCT GcACAACTTC TAATCCCGCA CTGTGAATAA 
CGTAGGACTC ATCCTCATCA CGCACGTCCA CCCTGACCTC AATGCGGTGC GCGAGACTTT 
CCACCAACGT ATCACGTCGA TCTAAAAGAT CATTAGGATT ATCACCCATC GCTTTTGACT 
TCACAATCTG TTCATTTAAT TGAGCAATTT TGGCAAGGAG ATCATTCACC TGCTCAACCG 
TTGCCTCAAT ATCCGCGTTG AGCATATCGC GAATACCGAC AAGACCTCTA TACTGATGAT 
GAATGGCATC CGTTAGCGTC TGTGCACGCG TGAGCACAAC CTGACGCGCT GCACGGGCTT 
CAGGATACAC AGACAGTTCC TGCCAGCCAT CCCAAAACTG GTCCAGCCTG GTACGAACTG 
CAATATCCTC CGGCTCATTA TACACCTGCT CTAAAAGACG CACATACGCA TCACGCGTGC 
TCCAATAroCC CTGTTCGTCT GTCTGAGACA CAATGCGACT ATCGAGGAGC TGGTCAcGCA 
AACGCGCGAT AGAACCGATG GTGAcCCCTT GTCCTATCTG ACCAGGCAGC TGAGCGCGAG 
AAAGATCAGG ACGGTACAGC GGCTCGAACG AATCGAGGTT TACTCGCTGG CGGCTATACC 
CCGGCGTGGA AGAaTTCGAC ACGTTGTGTC CTGCAGTCTG TACAGATTGC TTATGCGCGT 
AAAGAGCACG CTTTCCAAGT TCTATAGATG CAAATGTCGA CATGTGTTCT CCCTATAAGA 
ATGGAGGGTA CGCGCAgCAG CCCCATCAGG AACCTCCCTC TCGCCCTGTG GATACTCGCG 
CTCTGCTCCT CCCCCGAGGT ACCGCACCCT ACAGCACACG ATCAAAGACA AGACTTCCAG 
GCGCACAACG GACTGGACAT CCGTCCTTCG TATAGGAGCT GCCCTGCTGT TCACACGTAA 
GGGCGCTGAC AAGCGcGTGT GCCAGACCAC GCGCGTGAGT CAGATAGTGT TGGATTGCAT 
CGTGCTCATT TTTTGAAGAG GCAACTTTgc tACGCAGCGT CCTATAGAGA GCGACGACCG 
CATCGTGCAC ATTGACATCC GCGCGCCTCA AATAGGCAAA GAAGGAATCA AAGTCAACCG 
GCGCGTCACC GTACGGCCGA ACCTCTTGGA GAAgTAAGAA ACACCGTTTA TCGAGATGCA 
AGAACTCACG ACTCaGCGCC TGTGCACGGC TGAcAAAGGA TTCTACATGC TCCCaCGCAC 
GTGTGCGcAG CG AC TCGTAC ACACTACGCT GGACCTGTAT CACCTGGCCA ACAAGCTCAA 
TCTGCGCAAC AAGAATTGCC TCCACCTGCC CTGCCCGGTG CAAAGCCCGC TCTCGGTCCA 
TCGCCCACTC CTCTAGCCAG GAGTCTCGGC ACCTTTCCCG AGCGCTTTAC TTTTTAGTGA 
AATAAAAAAC CGTCCAGTGG TCTGCaGcTC CGCAGGCTAC TGGACGGCTC GCACCTGCTG 
GCTCTAtGCG CGGCCGGCGC TTCCGAGCAC TTCCTTGTTT TTGTGCATGT ACAACTTCTT 
AAGCTCATCG CGCGCAGGAC CCAAATACTT TCTCGGATCA AACTCATCTA CCTTGGTGGT 
CAGCACCTGC GTATAGCTGC AGTCATAGCG AGGCGACCGT CCGAGTCAAT GTTCACCTTG 
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CACACCGCGC TTTTGGCAGC TTTCCGCAAC TGCTCTTCTG GAtACCCACA GAATCCGGCA 2040 

GATTTCCACC GTACCTTTCA ACCTCCCGTA CGTACTCAAC GGGCACAGAC GAAGCACCGT 2100 

GCAGCACGAT GGGAAAGCCA GGAATACGCT TTTCTATCTC TGCGAGGATG TCAAAACGTA 2160 

GCGGAGGGGG GATCAACACT CCATCAGCAT TGCGCGTACA CTGCTCTGGC GTAAACTTTG 2220 

CTCTCCCGTG ACTTGTTCCG ATGGAGATGG CaAGGGAATC CACCCCCGTT TTTTTCACAA 2280 

AGTCCTCAAT TCGTCAGGCA TAGTGTAGTG GCTCTTCTCT GCCACTACAT CGTCTTCCAC 2340 
ACCAGcGAGT ACCCCAAGCT CCCCTTCCAC GGTGACATAG TCCGCACGCG CATGGGCATA . 2400 

CTCGCACACC TTCCTGCTTA GCGCTACATT CTCGTCGTAC GGCAACGCCG AACCGTCAAT 2460 

CATCACAGAC GAAAAGCCAC TCTCTATGCA GTCAATGCAC AGCTCTAGGC TGTCACCATG 2520 

GTCCAGATGC AAAACAATGG GAATATCAAC GCCGAGCTCA TGGGCATACT CAACTGCGcC 2580 

GCGTGCCATA TTGCGCAGGA GCGTCGCATT TGCGTACTTG CGCGCACCGG AAGAAACCTG 2640 

CAGAATGACG GGAGAACGCG TTTCAACACA CGCCTGTATG ATTGCCTGGA GCTGTTCCAG 2700 

GTTGTTAAAA TTATACGCAG GGATCGCGTA TCCGCCCTTT AC TGCC t TTG CGAACAGGTC 2760 

CTTGGTATTC ACCAAACCGA GTGCCTTGTA ACTAGTCATG AGAACCCCCT TTGTTAGGAT 2820 

TGCTTCGAGA AGAGTCACGA AATAGAGAAG CGTGCCACCC TCGGCAAGAG GGGCATGGTA 2880 

GGGCGATCGG GACGCTCTAG TCAACCGAAG CGCGAAGGCT TGAGTCCACA CGTCAGGCGT 2940 

TGGAACGGCA GCAAGACGAT TTGG AC AGG T ACCACGCGGG AGGTTTGACA AGCTATTTCT 3000 

CCATGCGCTA GAATGCGGCG AGCTGGCGCC TGCGAGGCGT TAGGGGTGGT GAAAAGGAGT 3060 

TTGCGAATG A AACAGGGCTG TTTTATGGTG GCGGGCTTTG CGCTGACGTG CGCGTTTTTG 3120 

GTGTCCCCCC TTGCGGCGCA AAGGTCGAAG GTCAATTACC AGGC AT AC TT CA 3172 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24699 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

CGTTTTTTTA ATGGGGTAGC CAAGCCAGCA TCCACCATTT TCTGCATAAT GGTGTCGTAC 60 

TCTTTTTTCG AACCGTCGCA GACGTAGACG GTATCTGGGG CACAGAGTGC GACCATCTCT 120 

TCTATCCACG CCTTTGCTCG AGCGTGGGCA ATCTCGTGAA GTTCCATAAC GCCGCTCCTT 180 
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GGTGCGTAGC GTGCTGCACG GGTATTCCAG GCACGTATCG 
AaAGGTAAAT AAAAAAGACC TCTTCAAACC GAGTGTCTCT 
GAGCGGTGTT CTTTACCTAG GnCGTTTCAC TCTCTGTCTA 
TCAAAAAGTG ATGACGTGTG CCGATACCGT CAGGGGTGCG 
TATCTACGTT GAGCTTCCCT ATTACTATCA AcTGACGCGC 
ATCGCTATGT GCGCGTATGA GAAGGTTCGC TGTCCACAAC 
ATCGTCCGTT CGTATCTTTG CATTTGAAGC ACACAGTCTC 
ACGCTGCGTG CGTGCGCTGT ATCAAACACT GGACACATAC 
TCGTATCCTC ATGGACGTTG TTGCTGACGA TGCTTCTCCC 
CCATGCATAC CGCAGTACGC TGATTCCTGA CCGTGGTTTT 
ACAGCTTCTC AAGCATTACC TTGAATTTTT GCCACTGCCA 
TAATGGTTTC CTTTCACTTT GTGCGGAAAA ACCTTTTCCA 
CATAGTTGTG CGTACCACTT CTTCATACAT GAGTGCTCTG 
TCCGTTGTCC GAAGCGGTCT ACTCAACGCT ATCTGAGGAA 
TCTGCGCGCT GCGGTGTCTT TTTTTAAAAG ACGGCGGTAT 
TTTAACCGAT GCATTTCTTC AGTATGTGGG TCTGTACTTT 
GCCAAATGcG GCGCCGCCGC CCATTTATGT AGACCCTTGT 
GCAGGCAGAG AAAGTACTGA TCGTCAGTCC ACATTCTCCC 
CTGCGCAGAT ATTGAAGCTA TTCCGCAAGA TCTAGCAGAA 
TGCCTCCCGT TATATTTTCG CGGACGAAAT AGAGGAATTT 
TGCtGACTTC GTCGGTGATT TATTTGACAA AATGTTTTGT 
GCACAACGCG TATGCCATTC CAGAGGATGT ACACGACAGT 
GAAAATGCCT GTAATACGCG AATGTATTTm wwcCCTTCCT 
GATCGCTTTG TGCTAGTACG GATCTGCTCA GAACTTTCCA 
CATCCGATTG TGTGTTACAC AGCTTGTTTC ATACGTATTC 
TACAGGTAGA AGAGTACACC GGCACGGATG TCGGCGCAGT 
CGCTGCTGGT GGGCATGCGT GAAGACGCAG AGGCCGCGTT 
TGACAACACT GCAGGCGCGG CGTTTTGTGT CCGCTGAATA 
GATTTCTAAC CATAGGTCAG AGCAAATTTG AAGACGCGTT 
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CCCCAGAAGT AkAGCGCGsw 240 

GTCACCGCAG GaGCCGACAT 300 

TTACCTGTCA GTTGTTTTTC 3 60 

CAAGAGGTTT TTATGCTATG 420 

ATCTTCCCTG CTGACATCGA 480 

GGTGCTGCCC TCCACGAGGC 540 

GGTTCTGTAT ACGcCGCGGT 600 

GAAAAGCAGG TGAAGGAATT 660 

TGTCTGATAG AAGATCGCTT 720 

TTTGCATCCT TTCGTGCAAA 780 

GCGCTGAATA TGTACCAGGT 840 

CAAGGGGTAA CCACGCACTG 900 

TGTAATTTCA TGGCGCTCCA 960 

ACGCGTGCGT TTTTTTTTCA 1020 

GATTCGTCTT TTCCCCAATA 1080 

AAGCTTTATT ACGAAgCGgC 1140 

GCTGGACATG AGAGCCAAAA 1200 

CTTATGCGGT TGCCTGCATC 1260 

GTCATGTATa CGCTTTCGCT 1320 

TTTCTGTTTT TGAAAAAACA 1380 

ACCCAGGTGA CGATGGTGCC 1440 

CTAGAAAAGC GTGTGCGCGT 1500 

TTGGAAGAAA TATCAGGAAG 1560 

GGAGCTTCAG TACAAATACA 1620 

TGACGTGCAG ATTGCGCACC 1680 

GTTAAAGGTA TACCAACACA 1740 

CAGAGAAGCA AAGGCTTGTC 1800 

CCGGACCTTT TCCCTCTTAG 1860 

GGTGTATTTT GGCTATGCAC 1920 
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TCGATGATGC AGAACAGctG CGCGACGGTG ATTTTCTCTG TTCCGCGCTT TTTCATTTGA 
GTATTACCTA CTTTTTGCAG CATAACTTTA CCCAGGCGCG GCTTTTTCTG AGTAAGCTAT 
CCGATGCGAT ATCCACGTAT TTTGAGCAGC GATGGAAAAC TGTCAGTCTG TTTATGCAGG 
GCAGAATTTC TCTCAGCCTC GGGGAGTATG CACAGGCGCG TCGGTGTTTT GATGAGGCTG 
CCGATTTTGC ACTGCAGTAC TTTGAACACC AAGAAC CCTT GTGCAGAGTG TGGGCTGCAC 
ATGCACGGCT ACTTGCGGAT AAGTCGTATG CAGCGCACGC GCTGTTTCAG GACATGTGTG 
ATCAATACCC TGATGCATAT CTCTTTCTTG TAGAAAGCTA TGTCCGCGCA GAATGTTTTG 
ACGATCCCAC GTTGTTTCAA TCGTTTCCTG AGGAAACGAC CTCTCGCGAG CCATGTGTGC 
CGTCCTTCTC TCTTGATACG CCGATTTACT CAGGGTTCTC CTGCGCAGAA GATCTGGTAT 
GGGGCAGGCA GTGTGCGTTT GCAGTGAGTG CGCAgCACAG tACGGTATTT GCTCATTACT 
ACCATTGCAG GGTGCATCTG CACCGTGCCG AGGATATGCA AACATTCCAC CACCATAAGC 
AAAAACTTGA GGCCATTGCA CGTCGCGCGT TTCAAATAGG TGATCCGAGT GCTGCGTTGT 
TTCTGTACCT CTGCTATGAT GTGTCCTACC GCGTGCACGG CGCAGAGGCT GCTGTCACGA 
CAGCGCACCT GAGTAGGGCG TTTAAAGTGA TGCAGCGCAg CGTTGCGTAT ATGTCAGAAA 
ATACCGTTCG CGCACAGTTC ATGCAGGATA ACTTTTGGAA TGCAAAACTG TTTGCCGCCG 
CGCAGGCAAA CAAACTCATT TAAAGCAGGG GGCACTATGG CGGTCAATTG TGGCATTATC 
GGTCTGCCGA ATGTGGGGAA GTCGACAATT TTCTCCGCGC TCACTGCAAA CGTCGTGGAG 
GCGGCGAATT ATCCCTTTTG TACTATCGAA CCTAACGTGG GTATGGTGAC AGTACCTGAT 
GTGCGTCTTG AAGCACTGGC TGGTCATTTT CGGCCAAAGA AAACGGTGTA TGCCTCCATT 
GAATGTGTGG ATATTGCTGG TTTGGTAAAA GGTGCCTCGC AGGGGGAGGG ATTGGGCAAT 
CGTTTTCTTG CGCATGTGCG AGAGGTTGGA GT AC TTGC AC ATGTGGTGCG CTGTTTTGAG 
CATACGGATA TCGTTCATGT ACATAATAAG GTCGATCCTC TTTCAGATAT TGAAACGGTG 
CaTATAGAGC TGGCATTGGC AGACCTGGCC TCGGTAGAAA AACGGGCTGT GCGTGCTCAA 
AAGGAGTCGC GTATGGGAAA GTCCcTTCAA AAGGAAAGCA CGCTGGTATT ACGGGCACTC 
GAATACTGCG CGAATATTTA GAAATGGGAA AGGCGGCATG TATGGCGCCG CTGTCGGATG 
AGGAgCGCAA ccGGTGCGCG ATATGCGCTT GTTGACAATG AAGCCGCACC TGTACGTGTG 
CAATACAGAC GAAAGCGGCA TGCAGTACGG AAATGATTTC GTGCGCGCGG TGCAAGAGCA 
CGCACGTGTG CATAACACGC AGGCAATTGT TATGTGTGGA AAATTTGAAG CAGAGctTGC 
GCAGCTTTCT GATGTGGCAG AGCAAAACGC CTTTTTGCAA GAATTAGGGT TGCGCGAATC 
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aGGACGTgCG GcTTGCGCGC GCAGTG TATT CCCTGATGGG GTTGCGTACC TTTTTTACCG 3720 

CGGGGCCTGA GGAGTGTCGC GCGTGGACCA TTCGGGCAGG GCTGCGTGCA CCGCACGGGC 3780 

AGGAGTGATC CACAGCGACC TTGAGCGTGG TTTTATTCGT GCAGAAACGT ATTCTTTCGA 3840 

TGAtCTTkCs TCCtGTGGGA GTGTGGCAAA GtGAGGGAGG CAAACCGCGT TCGGCAGGAG 3900 

GGGAAGGAAT ACGAGGTGCA AGACGGGGAC GTTATCtTTT TTAAATTCAA TGTGTGAAAC 3960 

ACAGGCGCTC CgTTCCGTCT GTGCGCCgTG TGCGATACAt GAGCCTTGAT TCTGCGTTTG 4020 

AAAGCAGGCA CAATGCtCCC GTGCAGCGTA TCATATCTTG GAATGTGAAT GGAATTCGTG 4080 

CCATAGAGCG GAAAGATTTT CTCAGCTGGC TCGCGCGTGA GGCGCCTGAT GTTCTCTGTT 4140 

TGCAGGAGAT TAAAGCGCAT GAGTCGCAGC TGAgTGTGCG CTTCGTGCTC CGGTCTGGAG 4200 

TGCTGGGGCG GGGGGTACGT ACTATACCTA TTTTCACAGT GCGCAGCGTC CTGGATACAG 42 60 

TGGCACGGCG CTGTTCAGTA AGCGCGCGCC AGATGCGGTG CGTTTCTTCG GGGTTCCGGC 4320 

TTTTGACTGC GAGGGGCGGA TGCTTGCGGC ACgCTTTGGC GAGCTGACGG TGGTAAGCGC 4380 

GTATTTTCCG AATGCGCAGG AAGGGGGCAA GCGGCTCGCG TATAAGCTTG ATTTTTGCGC 4440 

AcGTTTCGTG CGTTCTGTGA TGAAGAGCGT ACGGCCGGGC AGCACGTGAT CTTGTGTGGT 4500 

GACTACAACA TAGCGCATAA GGAAATCGAC CTGGCACATC CTCAGGAAAA TGAGGGGAAT 45 60 

CCTGGATTCC TGCCTCAGGA GCGTGCATGG ATGGATACAT TTACGGAGGC AGGCTATGCG 4620 

GATAGCTTCC GAGCCTTCTG CACAGAAGGG CAGCAGTACA CGTGGTGGAG CTACCGTGCC 4680 

CGTGCAcGCG CGCGTAACAT TGGATGGCGC ATCGATTACC AGTGTGTGGA CCAAGCCTTT 4740 

TTAGCGCGCG TGACCTCTTC GCAGATACTG TCCGAGGTGA CAGGATCGGA TCACTGCCCA 4800 

GTGTGTTTGA CGTACGCGGA CTAATCCGTT TCCGGGGTGA GCGGCACGTC CGCGCAAACT 4860 

AAGACGTACC CGCGCGCACA GGCAGCGTCA GAGGTGGTAG CGAACGTCCA CACGCGCGGC 4920 

TATGAACTGT GCGGTGCGCG TGTTGGTCTG CTGTCTATCT TCTTCAATAA TCTTTTCGCA 4980 

TGACCGGGGT ACGC CGCTGT ACGTGGCGCT TACCCCCAAG GACCAGTGCT CTGTCAGTTG 5040 

AAAATAGCAC CCCGctGCCG CCTTGAGCAC AAGACCGTAG TAGGTAGACG TGTAGTAATG 5100 

CTGATAATTG AAGCCAGCCC CTACCGTCAG TGGCAAGCGG ATGCGCCAGA AGGCAACCGT 5160 

GTACCCGGCA GTGAGGGCAA CGGGAATTGC AAGGTAATAG TACGGAGTAG TGGGACTGTA 5220 

CGTATTGTTT GGATAGCTGC AATGGTACTG CACACTTGCG TCAATCCCGA GCGACAGGCC 52 80 

GCGGCACACA AAGTGTTCAA ACCCTAACGC CGCACTGAAC GCGGGGTAGA TGTACTTGTG 5340 

CCCGTTGGTT TGCGCGTTGG CATTACGGTC GTCCCCGCGA CCGCTGTTAC ACCAATCCAC 5400 
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TTGAAAGAGG GGCACCGCGC CCATGGCCGA AAGGGTATAG TACTCCGGCC CGCCGCAGTA 
GTGTCCCACG GGTCCGCGTG CACCGGGTGT GCAGCTCCCC ACACTCCCGC GCATATTCCC 
AGCAC CGGGC CGACCGCCCA CCACTTCAAT TGTTTCATAC CCCGCTCCAA CGCCGATCCT 
CTTACGCGTC TCGTCGAGGA CCTACTCCAT TCTACCCCCC CCCCACGGCT GTTTGTCGAA 
CCCTTTTTAA AGGGTTCGTT CTCGCGCGCT GGGCAGCACG CGCGTGAGGC GCCTATGCCA 
TCGGGAGCTG CGTTTTTCTT ATGCCCCACG AGGGGACTGC GGGGTATGTC GTGCGTCCGC 
ATGGGTGTGG TATCGGTGAG AAAGACACCC TGAAATACAT TGCTCTAc TT CGTACCAGGA 
ACTGCAGCAG CAGGGGGAAC AGGGACACCC TGGGTGAAAA GACTGCACCA TGCTAGGATG 
GGGAATGGAT ATGTCCAAAA GTGTGATGCT GTGTTGCCTG TTGAGTGTAC AACCCTGTTA 
TGCCGGGTAC GTGTTTGTTT CCCCAAAGCT TGGCGTGTAT GGAGAAGCAT TGGGCGGTCC 
TGACACGGTG GGTAAAGCGG TCAAGCAGGC CGACGGTACT AAGATTGCTC CGAAGATATG 
GTACTACGCG CCGCTACCCC GCTTTTTGGC GTGGATATAG GCTATCAGGC GGATAACGGC 
CTGTTGTTCC GGGTGAATTT GGATGCGGCA CTCACGCGCC TTATGTTTCG CAGCCAGTGT 
GTGGTGGGCT ATTCCTTGCG GTTCGGCTGG GGGGGGGGGT ACGTCTCTAT CGCTTCGGGA 
ATCGAGTGTA GTGCAACGGT CGATGACGCG CAGTACGAGC CCTACACGAA AAATGAGCAG 
GGGACTACTG TTGCCTCCAA CACCGTGTTC CCGTGCACGG TCTTGGAGGC ATTGGTGCGT 
GATCCGGCCC TTACCGCAGA TTACCTGCTT TACGGTATGC AAAGCTGTTA CGCAATTCCG 
CTCCATGTGG GGGTTTCGTA TTACCTTGCC AAGCGCTGGG GTATTGAGTG TGCGCTTACG 
GCCTCACTTG GCATTTCAAT GCGGACGGAT GTGCGCGTCC CCTACGCGGT ACGCATAGGG 
CCGGTATTCC GCGTGTAGGG CCTCCGGTGA GCCGCTCTCC TTCCCATAAG ATGGCGTTGT 
TGGCTGGGGC TGGGGCTGGG GCTGGGGCTT TCCAATGGAC GGGCATGTAC GTACGGTCCT 
ATGGAACTTC GTTGTGGGCT GCGGCTCCGG GTAGGGCTGG GACTCCGGCT GCGGCTCCGG 
CTGCTTGGGC ATAGCCGCTA GACAGTGTGG AGTTCCTCCG GGCGACTGCG AGCCGAGAAG 
TAGATGTCAA CTCGGGCTGG GGGTACGGCT CGGGCGTGGA ACGAGGTTTT TTGTGTGAGG 
GGGATGTGCG CGCGTGTCTT GTTCCTGCCT CCCACTTCGT AAGAGCAGGA ACCGCACCAG 
GGACGACGCC GGGGACCG C A GCAGCTTGCT TGTTCGTACC GCCGTAC g TG CTGGGGGTTA 
CTCCTGCTGG GACgTTGCCG TTTGGTTTGC CCTCCCAGTC GGTGTTAGGG GGgcTGCGCC 
TAGCTCCAGT GCACGCGTCC TCCCTGCTTG AGGGGTTGGG CGCCACCATT TTTTTCAGAT 
GCAATGCCCC GTGAGgTTTG CGCAgTCGTG CTCTCGCATG GCTATGTTTC TAGCGAGAAA 
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TACTCCCTCG AATACATTGC TGGCCTTCGA TGCGGGCGGA CGTCCCACAC CGGGAAGTGG 7200 

ATGTCGGCTG GGGCTCGGGC GTGGGCTTTT AGGACTTTGT AATGGACAGG CATGTACGCC 72 60 

TGCGTCCTAT AAGATGTCGT CGGCTGCGGC TGGCACTGCG GCTCCGGCTC GGGCTCGGGC 7320 

GTGGAACGAG GTTTTTTGTG TGAAGGGGAT GTGCGCGCGT GTCCTGTTCT TGTGCTCCAt 7380 

TCGTACCAGG AGCAGGGGcT GCAGCAGCAg CTTTTTTGTT CGTACCGCCG TACGTGCTGT 7440 

GGTTTGCTCC TGCTTGGACG ACGCCCTTGC TGTCTTTGCC CTCCCAGTGC GCACCTTCCC 7500 

TGCTTGATGG CGCCACCATC GGATGCAATG CCCCCGACAC GTCTGAGAAG CTAAACGAGC 7560 

CTCCACATTC ACGCAgTCCG CGCcTACGGT GCCCTGCGCT CCGTCTTTCT TACGCCCCAC 7 620 

GAGGCTGGCG CAGTCGTGTG CGGTCATGGC CATGTTGAGA AATACTCCCT GAAATACATT 7 680 

GCTGGCCTGC GATGTGCGTG AGGCATCCCC ATACCGGGAA GTTGATGTTG GCTGCGGCTG 7740 

GGGCTTGGGG TCCGGTCAGG ACCGTGTAGT GGACGGGCAT GTACGCCTGC ATCCTATAAG 7800 

ATGTCGTTGT TGACTGCGGC TGCGGCTGCG GCTGCGGGTA GGGCTCGGGT TTTCTGAGTA 7 860 

GACGAGGGTT CGTACGTTCG TCTTATTTCC GCGCGGGCAT ACTCAGCAAT ATTCTGCCTT 7920 

CCA t TCGTAG GAGCAGCAGG AGCAGCAGGG GGCGTGGCCT TTTTGTTCGT ACCGCCGTAC 7980 

GTGCTGGGAG TTCCTGCTGG GGCCTTGCCC TGaCTGTCTT TGCCCTCCCA GTCGGTATGA 8040 

GGCAGGTCGC GTCTGTCCCT TAATGTGTGC CTCCTTGCCT GAGGGCTCCG GCGCCACCAG 8100 

TTTCAGATGC AATGCCCACG GCATCGCCTG AGAGGCTGAA CGGGTCTCCA CACTCACACA 8160 

GTCTGCGCCT ATGTCGCCAT TCGCTCCGTT TTTCTTATGC CCCATGGAGC GTGCGCAGTC 8220 

GTGCGTCTGC ATGTCCATGG CATTGGTGAG AAAGACGTCT TTAAACACAT TGCTGGCCTT 8280 

CGATGCGGGC GGACGTCCCA CACCGGGAAG TTGATGTCGG CTCCGGCTCG GGCTGTGGCA 83 40 

TAGCCGctAA CGCACGGGGA GTGCACGCGT TTTTACCATG TCACTTTCAT TCCGCAGACA 8400 

AGGGTGCCGA AGTGGCGTTC GGACCAGATG CTCTCGGCAA TGCCCATGTA AGGAGCGTCA 8460 

gcAAGCACGC CCTGTTCCCA CTGGGCGCTG AGCTCCACCT TCTCGAAGGG ACTGAACGTC 8520 

AgTCCCACCT GGTACTGGAG CGCTCGTTCA TTCAACAGGT TGCCCGCGGG GTTAATAATG 8580 

TTAAAGCGAT TGGTTGTGCC GAGCACGGAT GTGTGTGGTG CAAGCCAGGC GTGGGAACCG 8640 

AGGGGGATGC GATAgcTGcA CCACGCCTTC CCCAAAATTG GCATATTGAT AGTCCCAGGG 8700 

GGCACAGCTC CATTCAGTTC GTACCCTCCG TTATTTCTGT AACGGATGTA GGTGAGGGGG 87 60 

ATGTACACGC GTGCTTCGAC GCCGGCGTTC AGGCCGGTGA GCAGGTGGGT GTAGGGGTCA 8820 

CCGCTTTTGG TTTCGAGCTT AAGGAATCCG GCAAAATCAA AGTAGTGCGC ACGAGTGGTA 8880 
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GCAAAGACGC GTTTGCCAAA GAT ATT AG TG CCTGCGGTGG CAAAGTATAT GCCAGAAGAG 8940 

AGCCACTTCC AnTGCATACG CAGGAGCGCG TCTATGTTGA GCGCGTTCAT AGGTGCGCGC 9000 

TCAAGGAAAG CGAGAAGTTT AGCAGTGACA ACTCTTGGAT CGGAAGAGCG GAAGACATCA 9060 

CGTACTCCTT GCTCTATGTT CGGTACAAGT TGCGATACaA GCGCCGCGAG CGCGCCAGCG 9120 

GCTAGCACGG TTTGAATGGC GCTGCCGAGC GTTCCTTCTG CAATCAAAGC AGCAAGTCCT 9180 

ACCATCTCTA TGAGAGTGGT TTGTTCGGTG ATTCCTGGTG GCATCATGAT ATTGGGAAGG 9240 

TTCTGCACGA GTTTCCCCTC CACCCGTCTA AACACTTCCC TTGCTTTGAG GATAGCTCTC 9300 

TCTTGGGTCT GAGCATGTGC GTTACTCTGG TGTTGGTTAC CGGCGTCGAG GGCGAAGGAG 9360 

AAGCGGAAGC CGGCGCCTGG TTCGAGGGTG AGTCGGCCTC CTACTCCCCA CAGGAGTGCT 9420 

GTTTTGTTTT CGTTCTTGGA GTCTTCGGTA CCCTTAACGt AGTTCTGGTC CAGTGTGGCA 9480 

TTCCCTGCCA GCTCCAACGT AAGCAGCCGC TGACGGTCGA CGCCATAGGA AAGCGTTGCA 9540 

TCGGCCCCGA AGCCATACTT GCTGTGCGTG GTGTCAGTAC TATCCCAGGC ACCATTGGAA 9600 

AGGAAGGAGA GGAAACCGAT GTCCACATCT ACTCCGCTGT TTCCCACATT GTGGGCCTGG 9660 

TAGCCGAGTT TTGC CCCGGA GCCGGAGAAA CCAGGGGCAT AGCGAGTGTC CTTTTCTGAA 9720 

TAGGCACGGG TGACAAAGGG TTTCC AC AG C TGGGCAAAGT TAACCACACA GGAAGGACTG 9780 

GTACCCACTG TCAGGTAGGC CCCATAACAG TGCAGGGTTG CCTGGAAGGA AGCGGTAGGT 9840 

TTGGTAAAGG ACAGGGCCGT TGAGCTTTTA GAAGACGCAA GCTCTACTGC CAGGTCCTTC 9900 

AGCTGCAGCT GTGCCCACAC CCCTGAGCGT GCCTCCCCTC GGCGGGTGTG GGTGTGCTTT 9960 

GACACCAACG GCAGGGAAAT AGTCAGACTA TTGGTAGTGC GAAACCCATG GGTGTGCTTG 10020 

CCCGGGCCAG TGCGTGGATT CTTCTGGAAC GCAATGCCCC ACTGGAGCTG GGCTGTGCCA 10080 

OTGAcTGCGG AGTGAGTACG CCTGCATAAC CAGAAGCAGC ACATACCATG CCCGCAAGTA 10140 

CCCCCGCTTG CATCACCTGC CTGCCCACTC ACTCCCCCTC CTCTCACTTC TACCTCACCC 10200 

CCCCCACCCG TCTAGCCGCG TGTGACTACC AGGAGAGGGT GACGCCGCAC ACGATGCGGC 10260 

CGATTCCCTG GGTGAGGCAC TCGGAC AC CA GCAGGTACGG GACATCAGAG AGCATACCCT 10320 

GTTCCCAATC AAGGGAGAAT ACCGTCTTCT CTATGAGACT GGCTGAAATA CCAGCACGcA 103 80 

GCTGTGCACA GTACTCCTTG GTTAGATAGG TAGCTCCTAC TGCTCCACCT GCAGCAGGGG 10440 

CATTCAGGTG TGcACGGTTG GTAGAGGCAT GGACCGTAAC GCTTGGCTTC ACCCAGCCGT 10500 

AATC CTGCAC CGGGATGCGA TAGCTACACC ACGCCTTCCC CACCACCGGC AGGCCAATGT 10560 

GcCCTGAGGA ACCGcCGGAA GGGAGAGGGT TCCCGTTATT ATTTTTGTAC AGGTCATGGG 10620 
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TGAGGGGGAT GTACACGCGT GTTTCAACGC CGGCGTCCAG GCCGGTGAGC AGGTGGGTGT 10680 

AGGGGTCACC GCTCTTAGTT TCGAGCTTAA GGAATCCGGC AAAGTCGCCA CAGCTTGCGA 10740 

TGGTGTTATC TAACACCCTG GTGCCAAAAA CGTTTGCCGG TGCTGTGGCA AAGTATATGC 10800 

CAGAAGACAG CCACTTCCAC TGCGCCGTAA ACAGCGCATC GAAGGCGACA TTGTAGGTGT 10860 

CAAGATACAG ACACACGGCG CTGACTCCCA TTAGAAAGGC ACGCCaTGCA GACGCACGCA 10920 

GGTTCTGTAT AGCC t GACGT ATCTGCTGCC CCGCATTCAA CGCATCCGTC TTCTTCTTCA 10980 

CTTCTTCGGT TATGTGTTGC TGGACACTCG CGAAAAACGC CGTTATTTCC GTTCGCATCA 11040 

TCGGCACTAA ATCTGCCAGA TCCTGTTTCA CCTCATCTTG CACGCTCACC ATGGTCAGAA 11100 

TGCTGTTCTG c TTACGTG AT AGTTGCTGCT T AATGAgCG t TCGCCTACCA TTCTCGCAGT 11160 

ATCGCCTAGA CTACCCCCGA CCGCTTGCAT GTTTGCATTA TTTTTTGCCA CAkCCGCTTG 1122 0 

CACTTTCTGA TTAATTTCAG TGACGATTTG CGTCTGTACC TGCTCAAACC CCTTCACCAn 11280 

CCTGTCCGCA TCGTACTGCA GCAAAACCTG CCCCATCAGG GAAAATGCAG GAAGTGCAGG 11340 

AAgCGGCGGC AGGTTCGGCG GACTTCCTGC GGGGTGTGAA GATTTTGCAC AACyTTACCG 11400 

GTAGGTTTAG CAGGATTAGG CTGAACTGCC TCTAGCGCGT TTATGTACGT AGTCCCCCGA 11460 

GATTCCAGCG CGCTTCGAAC TCCAGCCGTT ACTGTCTGCG TCGCCTGTTG CACTACCTGG 11520 

GTTACCCAGG CTTCCTGTTT TTGACTTTCT CCCTGGAAGA GGTTATTTGA GAGGGCGGTG 11580 

AGTTCACTCT GCGCCCtCTG TGTGCGATTT TGAAAgTCCT GTGCACTCTG GTGTTGGTTA 11640 

CCGGCGTCGA GGGCGAAGGA GAAGCGGAAG CCGGCGCCTG GTTCGAGGGT GAGTCGG CCC 11700 

CCTACATTCC ACAGCAGTTT ATCCTTGTTC TGATTGTTTG CGTCCTTCTG TGCACCGATG 117 60 

AGGTATCCGT CTTCTAGCGT AACATTGCTG GCAAGCTCTA CCGTGCACAG AGGGTGTCCT 1182 0 

GCACGCGCAT ACATTAGCTT CAAGTCTGCC CCAAAGCCAT ACTTACTGTG CGTGGGGTCA 11880 

GTACTATCCC AGGC ACCGTT AGAGGCAAAG G AG AG AAAC C CCACATCAAG GCTGACCCCA 11940 

CTGCCCCCAA TGTCCTGTGC CCGATACCCA ACCTTGCCGC CTAAACCCCC AAACCCCGGC 12000 

GCATACTGTA CCGCATCCTC CTGGTACTGC GCTGTCACCC ACGGCTTCCA CAGCCGGGCA 12060 

AAGTTCGTCA GAAACGTGGG GTTCTTCCCA ATCGTCAGGT AGGCCCCATA ACAGTGTAGT 12120 

GTCGCCTCTA CCTTCCCCTT GCGCTTAACG GCAAAACCTG CCTTCCCCTG ACTCAGGTCC 12180 

GCCTGCAGGT CCGCCACCTT CAGctCCGCA TACAGTGCCG GGTGCTGCCC ACGGCGCGTG 12240 

TGGGTGGTGC GCATAACCAG GGGAAAGGAT ACTCCCACCG TGTTGGTAGT ACGAAACCCG 12300 

TGCTTCAGAT TGTAGGGACC GGTGCCCATA ACTGCACCAG GGGCCTGGCC ATGACTGCCT 12360 
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ACCCCCTTGC CATAGCTGAT GCCCCACTCA AGTGTGGCAG AGCCAGTTAG CTTCGGGGAA 12420 

AACTCCTGTC CGAGCAcTCC CCgCTCGCTC CTACCCCCAC CACCACACAC AGCACATCCC 12480 

CCACCGCATG CACCCCATGm TACCTCACCC CCCCCCCCGg cCyTGTCTAg TAGCCCCcTC 12540 

ACCCTGCCAC cTGCACACAC GCAAAAACTC ACCACTCCTT GCACCTCCTT GTCCCCTTGG 12600 

GTTACACTGT GACCCCTTAT TTTGCGCATG TTTGTCGATA AAGGGGGGAG GATTCTTGTG 12660 

AGAGAGAAGT GGGTACGCGC GTTTGCGGrC GTTTTTTGCG CCATGCTGCT CATCGGCTGC 12720 

TCTAAGAGCG ACAGGCCGCA GATGGGAAAC GCAGGGGGCG GAG AAGGTGG TGAtTCGTCG 12780 

TTGGAATGGT AACCGATTCA GGGGACATCG ATGACAAGTC CTTTAACCAG CAGGTGTGGG 12840 

AAGGTATTTC GCgtTCGCAC AGGAGAACAA CGCGAAGTGC AAGTATGTGA CTGCTAGCAC 12900 

TGACGCTGAG TACGTGCCTA GTTTGTCTGC GTTTGCAGAT GAGAATATGG GGCTCGTGGT 12960 

AGCATGCGGC TCTTTCCTTG TGGAGGCGGT CATCGAGACT TCTGCTCGTT TTCCTAAGCA 13020 

GAAGTTCCTG GTCATCGATG CGGTTGTCCA AGACCGGGAT AACGTTGTTT CTGCAGTGTT 13080 

TGGTCAGAAT GAGGGGTCGT TCCTTGTCGG CGTTGCAGGG GCGCTGAAGG CGAAAGAGGC 13140 

GGGAAAAAGC GCCGTCGGTT TCATCGTTGG CATGGAGCTG GGTATGATGC CTCTCTTTGA 13200 

AGCGGGTTTT GAAGCGGGGG TTAAGGCCGT CGATCCCGAC ATACAGGTAG TGGTTGAGGT 13260 

TGCCAATACC TTTTCAGATC CCCAAAAGGG GCAGGCGCTC GCGGCAAAGC TGTACGACTC 13320 

GGGCGTGAAT GTCATTTTTC AAGTAGCGGG GGGCACAGGA AACGGCGTTA TCAAAGAGGC 13380 

GCGCGATCGT CGTCTCAATG GTCAGGACGT GTGGGTTATT GGCGTAGATC GTGaCCAGTA 13440 

CATGGATGGG GTGTACGATG GGTCGAAGTC TGTGGTGCTT ACCTCCATGG TCAAGCGTGC 13500 

GGATGTCGCT GCGGACGGAT CTCAAAGATG GCGTACGATG GCTCTTTTCC CGGGGGGCAG 13560 

TCCATTATGT TCGGGCTTGA AGACAAGGCA GTGGGGATTC CTGAGGAAAA TCCCAATTTG 13620 

AGCAGTGCGG TTATGGAGAA AATTCGGAGT TTTGAGGAGA AGATTGTCTC GAAGGAGATA 13680 

GTGGTTCCGG TGCGATCTGC ACGCATGATG AACTAAGGGG GAGAGGTGCC TCCCGTGCCT 13740 

GCGCGCGGGA GGC CTTCTTC TTCATCTGAT TTTTGTTTGT ACGGCATGGC CGTCGTGAAT 13800 

GGCTTCTGTG TGCAGGACAT TCCCTACGGG TCACGGGTTG TTTTGCCGGG GCGTATGCGT 13860 

TCTTCTTCTG CGGGTGCGTA GAGTGGGGCG TGTGTCTCGA CCCGCCCGTG GTCAGTGGGT 13920 

ATGGGGACGT CCAGTAATGA ACTTGAGGGA GGGGCTATGC CATACGCGGT GGAAATGCGC 13980 

GATGTAACTG TCCGGTTCCC AGGCGTTGTT GCCAATGACT GTGTTTCTTT CGGTGTGCAG 14040 

ACCGCGGAGG TGCATGCCTT GCTGGGAGAG AATGGTGCAG GCAAGTCTAC GCTCATGGGA 14100 
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GTCCTTTTTG GTACGTGTCC GAAGCAATCT GGAGAGCTGT TTGTAGATGG 
TGCATCCGTA gTCCGCGCGA TGCGcGCGCC ATGGCATTGG CATGGTGCAC 
ATCTGGTTCA CAATCTAACC GTTAGTGAGA ATATCGTTCT TGGCGTCGAG 
GCTTTGCTCG CACGGATGTT CGTGCTGCGC ATCGCCAATG GGGAGAGCTG 
ACGGACTTGC GGTgGACCCA TACGCGAAAA TTCAGGACAT CACTGTTGGC 
GTGTTGAGAT TCTCAAAATG CTTTACCGCG ATGCTCGGGT GCTCATTTTT 
CCGCAGTTCT CGCTCCACAA GAAGTGCAGC AGCTGATGCA GGTGATCAGA 
GTGAGGGTAA GGCGGTGGTG CTTATCACAC ACAAACTGAG TGAAATTAAG 
ATCGCTGTAC GGTAtGCGCA GGGGGGCGTG TATCGGTACG GTTTCTGTGG 
AGAAGAACGG TTGGTAGAAA TGATGGTGGG CCATGCGGTG GACTACGCGC 
TTCAAGGAAG GATGGGGCGT G TG T ATT AG A GGTGCGTTCC TTGAGCGTGG 
CGTTACGTCT GGGCAGATGT GGgctGACGC GTCTCCTTCT GCAGCGCCCC 
CGTCCGAGCC GTAAGCTTTC AGGTGCGGTG CGGGGAGATC CTGTGTATCA 
CGGCAACGgT CAGTCACAGC TGCTCGAAGC AATTGCAGGT CTTGTGCCGG 
TCAGATTCTG CTTGACGGGT GCGAGATAnC amACACTTCC GTGCGCGAGC 
TGGTGTCAGT TACATTCCTG AGGATCGGCG GAAGCACGGC CTTGTGCTCG 
GGAAGAGAAT ATGGTCTTGC GCTCGTATTT TCGCGCGCCG TTTGCGCGGC 
CGATCGGCGT GTGAtTGCGC AGCATGCGCA CGCGTTGGCG AAAAAATTTG 
CGGTGCGCTC GGGTGTGCGG TTCGTGCGCG TACCcTTTCA GGAGGTAATC 
TATCATTGCG CGCGAGTTGC ACCGTGCACC GCGTCTTTTG ATTGCCGCGC 
CGGACTTGAT TTGGGTGCGG TTCAGTATGT TCATCGCGCT ATTGTCGCCG 
GGGGGGTGCA gTGCTCCTCT TTTCCCTTGA TATGGATGAA GTGCTTGCAC 
TATTGCAGTT ATGTACGAGG GAGAGATAgT GGGGACCGTG CACGCGTGCG 
GCAAGAGCTc GGGCGTCTCA TGAGTGGGAT GCGGAAAAAA GAGACTGCGG 
CGGGGTACAG GGGTGATTGC GCGTCTCCGG GGGTGCCTGG TTCACCCCAA 
CTGCTTATTC CCTGCTTGGC GGTGATCTTG GGGTTTGCCG TAG tGCGGTG 
TGTCAGGTTT GCACCCTAAA TACATTCTCA TAGCTTTGGT ACGTTCGATG 
ATGTACAGGC TTTTGGCACC GGCAGGTCCG TGTGGAACTT CAGGTATATG 
TGGTGACGTG TCTGCCGCTG ATACTCACAG GACTTGCGGT GGCATTTACG 



CAGGAGTGTG 
CAGCACTTTA 
CCTCGTGCGC 
TGCGAGCGCT 
ATGC AGCAGC 
GATGAACCTA 
CGTCTTGCTC 
GCAATCGCCG 
CTGAGGTGGG 
TGCCTCGCGC 
GGGCGCGCCG 
GCGCGTACGG 
CCGGTGTAGA 
TGTCGGAAGG 
GTGTTCTGCG 
ATTTTTCTGT 
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AGCAAAAGGT 
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AACGTAATCG 
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GCAAAAAAAC 
ATACCACGCG 
GTAATGGCGG 
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GATTGTTCAA TATCGGGGCA 
TCGGTGTCCT TTGGCATGAG 
GAATGGTAGG GGGAGGACTG 
TCAGTGAGGT GGTGGTTACC 
AGTCACCGCT TTGCCTGGGA 
GTTGCACAGT GATTTTCTCT 
GCTTGTGATA GCTGCACTGG 
TGAGCTCCGT GTCGTTGGTG 
GCGGCGTGTG ATGCTTGCAA 
GCTAGCGATC GGTACTTTTT 
TGaGGGGATT GTGGTGTCCT 
TTCGCTGcTC GGTTCGTTGC 
aGGTGTCGGT AATTATCGTT 
GGGCGATGCT CGTCAGGTGG 
GGTGGCGCTG ACGCTTGTGT 
TTCCGAGCGG AGCGrGGTGA 
TTCCACTGCT ACGGTGACGG 
ACTGGGAGTT GGCAtGGcAG 
GCGCTTGTGC AGTGATCAGA 
ACGGTTTTTT TCGCACAGGT 
TTAGTTAAGA CTAGTTACGG 
ACCCATACGT ACCCGACAGT 
CTGTATCGCA CGCCTTTCGG 
GACGGTGCGG GCTTGAGTGT 
TGTGCGGGAC TTGGCGGCGG 
AGCACGCATG GGACGGGGTT 
TTCGGGGTAC TGGTGACAAG 
ACGAGTGTTG AGTTGTTGAA 
CTGACGGTTG TGGTGCTGcT 
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GAAGGGCAGC 
CACCTTTCTT 
TGGGGTTTGA 
ATTATGCTCA 
GCGACTTGAT 
CGCGTGTAAG 
TTAGCTTTAA 
CTAGTGCCGA 
CGAGCATTTC 
CGTACGGGCG 
TGGTGGGCCG 
GCGCGGCAGG 
TCGGCGATTA 
GGGAGGCAGG 
TTTCAACCCC 
TAAATATTGC 
TCCTGTGCGA 
TTGCCGCGTC 
TCATCGCAGG 
TATTTTTGGG 
TTTTTTCAGT 
ATATCTAGGT 
TGTGCACGTG 
GTTTCGTCTG 
GGTGCTGATA 
TATCGCACTT 

ACATTTGCCT 
ATTGTTTGGC 
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TCGTAGTCGG 
TCTTTACCAT 
TACCAGGGGT 
AtACGTAGGA 
GCGTACGGTG 
CAATGGGTCG 
GTTTCTCATT 
AGCGGCCCGC 
GGGTATGTAT 
GGTGTTACCC 
TAATACGGCG 
CCCACTCATG 
TTGTCTTTCT 
GTGCGCACGT 
TATTTTGATT 
CCTTGAAGGG 
GCCGTATACG 
GGTGGcGTTG 
CACTGaTnCA 
CAGCAGGGAA 
CGCATTCCGG 
TTTGTGCTAG 
CGTGCCACAG 
CGGTCTGCGG 
CTGACGCAGG 
GCAGCCTTGA 
GGCTTTTCAC 
ATCGAGCTGT 
GGCCGTGGGG 



TAGCGTGTGC 
TCcTGCGGCG 
GTTGCGCGCA 
CTGTATGGGG 
TCTTTACCCC 
CGTCTGCATT 
GAGAAAACAA 
TATGCGGGgA 
GCAGGGCTCG 
GGATTTGAAG 
TGGGGGTGTG 
CAGTTGAACG 
TTCCATGCAC 
ATGAACACGT 
ACTGCGTTGG 
TTGATGATGT 
ATAGCTGCTC 
TTTTACGCAT 
ATTTGTGTGC 
CGCAGGCGTA 
TGCTTGGCCC 
TAGCATTGGC 
GGGATCAGCC 
CGGTGGTAAT 
ATATCCAATA 
TTTCAGGACG 
AGATTTTGAA 
TTAGCGCGCT 
AGGCACCGCG 
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GCAGTGTGTG 15900 

GTTCTTGCAG 15960 

GTGTGCGGGA 16020 

CGAATTTTGT 16080 

CAGCTGCGAC 16140 

GGGGCTTTTT 16200 

CGTTCGGCTA 16260 

TTCACATCAG 16320 

CCGGTGTGTT 16380 

GGTATGGATT 16440 

TGTTCGGCGG 16500 

GAtGCcGAAG 16560 

AATGGAATCC 16620 

TTTATTCGAT 16680 

GGGGGTTGTT 16740 

TTGGTGCTTT 16800 

CGTGGATTGC 16860 

ATTTGAGTGT 16920 

AACAGGAATG 16980 

CTCTCGTGGG 17040 

GATGGTTTTC 17100 

GTGGTACGTA 17160 

GTATGCAGTA 17220 

TTCAGGACTC 17280 

CACCGTCTAC 17340 

GTGGCATCCT 17400 

CGTGTATGCC 17460 

TCCGTACGCG 17520 

TGCCATCGGT 17580 
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CAGCCCTATG ACCGTGCGCG G AGG T ATT AA TTTTTGGAAG AAGGgTGAGT TGGGTGGCAC 
CTATTCTTTG CAATGAGCAC CCGGGTAGGG GGTTCTGTAT TCCGGGCGCG TTTTTTGCTA 
CACTTAGCGG GCTGTGGCTG TGCTTCCTTC GGTGCGTAGG TTTGAGTGCT CACTGTTTGT 
GGTGCTCGTG CTCTGCGCTC TGGCCGTCTT CGATCCGCTT TCTGGCTTTG TGCAGCAAAA 
GTTGGCCGGT GTGCAGCGCG TCTGGCTTGG CTTAGTTGAG GAGTATTCAG GTTTACGTTT 
TCAGTATGAT TCTCTCTCCC CTTCTGTTCT CCGCGCAGTT ACGCTGAGAA ATG TTCGTGT 
TCGGGAAGCA GTTCGCGGTG AGCAGGTTGC CGTCTTTTCA AAAATAGTCG TTGCGTACAA 
TATTTTCTCG CTTTTTGGTT CCAACCCTGT GCGGGGTATT CGGGCTCTTC ATGTTCATGA 
CGGAGCAGTG GACGTCGACC TGTACCGTCA CCGTCATGTG AAAGAAAAGT TACAAAAACT 
GTTCTCGAAA GACGGGGAAA TGGCTTCGTT CTTTGCCGAT TTGCGCGAAA TAGACGTGCG 
CGTC C AT AAC ACTGCAGTTA CGGTGCGCAg CGATTCCAGA CGCGCGCACC TTTCTGTGCC 
GCAGGGTAGG TTTTCTTTTG CGGAAACTGG CGCCTCGTTC GCTCTTTCTT GCGAAGCTGA 
GTATGTCGAC ACCCGTTCCT CTTCCTGGGG ACCGCTGTAC ACACACCTGG ACGCCTCAGG 
CGTGTTTGAA ACGTCGTTTA CGTCAGGTTC CGCCACCCTC GAGCTTGCAC CCCCGAGCGG 
CTCTTTTTTC AGTGTGCCGA CGCTTACTCT CGTGGCAATT TACGCAGATG ACCTGTTTAA 
GTTTCACACG GCGCGGGGCA TCTACCCTAT GGAAGTTTCT GGGCAATGGA ATACTGCAAC 
CGGCGCTTGT GAAGCTTCCG TGCGCTGTGA AAATTTTCGT CCCCTTAAGT GGGCGCGGct 
CCGCGACACC C ACGTGC C AG CACAGGGTAT GCAGGAATTG TCTGTGAGCG GGAACGTTCA 
GGTTGGGTAT ACCCCCATAG AACAGTGGCG GTGGAGTGCG GATGTGCACG CGCACACCCC 
GTATGTAGTG ctTGCGCCGG GGTATCAGCT GGAAGACGTT GTCGCAACGT TACAGGCGCA 
CGGTGATCCT GCACGGATTC AGGTAGAAAA GATATGCGCA CGAGGTAGTA ATCTTGATGT 
GGACGGTGCG TTCGAGcTCA CGCTGGACCG CTGGATCCCT TCAGGGGTGC TTACGGTGCA 
CAGGCTGCCG CTTCTTTCGG GGGCATACCT TTCAGCGCAG tGCGTTTTCG CCCACAGGGG 
GTTGGTTTTG TGTGC ACCGT CCCGCGGATA CAGGTGGGGG AAGCGTTTCT GGAGGACGTG 
GCGCTCTCAG TACGTGTGGA TCCGGCAAAA ACGGATTTCC GCCTGGTGGC TGCAGACAGC 
ACGGGGCGCT ACGAGTGTGA CGGATCATAC CTTGCCGCGA ATGnGGGGCA GTCTCGCTTT 
CTTGAGGCAC ACGTGGCGTT TGAATCGGTG AATGTCGGTG CGCTGTACCA AATGGTTGCT 
GCCTGTACGT CACCGCAGGC GCTTCCACGC TCGTGACGCG CGCACTGGTG CCGTTACAGT 
CAACAGCAGA TTTTTACGTT TCAAGTGATT TTCGTGATAT TTCGTACAAT TGTGTTCGTT 
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TGGTGCTCGC ATCGGATGAA ATCGCTGACC TGTACGCCCT GCTGTCAgTG CAGGGGACGG 
CAGCTTCCTT TTCGGTCACG GATATTTCGC TGCTGTGTAA GGGACTTGAG GTACAGGGGA 
ACGTGATGGC GAATTTTGAA CACGGGGGAG ACGCCCTCTT TGAAAGTGTC CTATCCATCA 
ATTCGGTTCC GTATCGTACC AGGGGAGTAT ATGCCGACCG TACGCTGACG GTGTATGGCG 
ACTATGATTT TTCGGTGGTG GCATCGTTTG ACGAGCGCGC AGGGgTTACC GGCACGTTTC 
AGGTGCAGAA TCTGCCGGTT CCTCTCTCTC AGAGTCTTTT TGATTGTGAC AGTTCTTTTG 
CAATGCGTAG TGCCCACTCG TGGGAGGTGC GCTTTCATCA CCTGCACCTC CGTTCTGGGG 
CGGTCGCCGC AG tGGATCGG AGCAAATAGA AACGGTCTTG CGCCTTGCTG GCGTGGCGAA 
CCAGGCCGGT GCTCTGTTTG ATCAGGTGTT TTTTGGTTCT CGCGATCGGT ACTTGGCTGG 
AACGGCGAGC TTTGCCGTTG TGCCGAGAAC AGGGCAGCAC GAGCAGGCGC GGTATGAAAC 
GGCCGTGCGC CTTGCATCTG AAGATGCGCA GGAGCAGGTG CAGCTTAACG CGCAGGTAAC 
CGTGGGGGAA CACGTCTATG TGGATAGCTC AGGGCGAATA GATAACGTAG ACGTGGGGCG 
TTTTGTTGCA GGGCAGGGGG AGCGCAGTCG CGTCACCGGG TCGTGGACTG TGCTGGGTAC 
GATGCAGGAT ATGTCTGGAC AGGTGCAGGT AGATTCACTC GAGCTGATCG CCAAGGGAGT 
GCCCTTTCAC CTGCGGGGAG GATGTGC AC T TGATGACGGT wcgCTTGCGC TTTTGCCCAC 
CCAGGTGACG TGGGGGTCAC ATCAGTTTGC TGAcCTTGCA GGAGAATGGG TGCCGGGTCA 
GGCGCGTGCG TGGGTGCGCA CCACGTACTC AGGCGCGTTT GAAGGGCAGC CGACACATGC 
CACCTGTACG CTCACCCTTG CCGGATCCCC TGTGGATTCG GG TAAGGCG A CATCTGCAcT 
GCGCACGTCG TTTCTCACGC CATTTTTGCA GACGCACAGT CAATACACGA TTTCTGCGGA 
G TTTGAGC AC TGGCGCATCG CCACATACGA GGGTGAAAAG AACCGCATAC TGGTAGTGCG 
CGATCCGGGC GTATGGGCGC TGTACGCCGG TGAGCACGAC GAAATTACCG GATTTATGCT 
GGATGATGGT TCAGTGTCGT TGCAGGTGGC GCAGAGTTTG CCTGTTCATT TTTTCTTGAA 
CGGGTCGTTG AGTGCACAGC AGGTAGACGT GCAGATTCAG GATATCTTTG TTGATTTGGC 
GCGCGTATGG GCGTTTACGG GCATACGGCA TGTGCGCGTG CACGAAGGAG TTGCGGTAGG 
AAACGTGACG GTATCTGGAA ntnCGTGCGCG CCCGGTGTTT GAAGGAAAGT TACGGGGAAA 
GGAGGTAGTT GCCAGCGCGC CTGGGTATGC ACCTGAGCGC TTTGGGCCAG GTTCTATCGA 
TATAGTAGCA CACGGCAGCA CGCTCATAGT GCCGTATACA GAGTTTCCCG GTCCGACGGC 
CCGTCTTTGG GGTGAGTGTG TTGCACAGCT GAATGGATTT TACCCGGATG AGGTGGTTAT 
CAAATGCGGG ACGGTAGGAG ACGCGCTGGG TGCGATTCAG ACGGATAACC TGCTTTTTGC 
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GATGGACGGG TCAGCCGGCT 
AAACGGAAAG GCGCGCTTTG 
G TTTT AC AC A AAATACGCGG 
GGGAAATAAA GTGGAGTTTC 
GCACGCGCAG GAACCGTTTG 
CGGGTTTGCT CATCTTAAGG 
GGAGGGGACG ATTCACTTTG 
TGCGGAGCTG AAGGACAGGG 
GGACCAGGTG TTTTCTAAGC 
AGAACTGGCG AAAATTTTAG 
GCAGAACGTG GCGAGTATCG 
GGAGGATAAA ATC CGCTCAT 
GCAGAACGCG ATTTTTGGGA 
TAACTATTTT GACAATACCT 
GGATGCGCTG CTTCACTTGT 
AnArCtGCGG CAGGGAGTTT 
TTTTTTTCGT TGCGGTGGGC 
• AC TTC AATGC GGGTGTCGTG 
AAGGAATGCT CAAAAAAGCC 
CGTGGGCACA GGCAAACGAC 
AGGGGCTCGA ATATATTGCT 
AAAAGTGGAC CTATGAGCTG 
TTTCTGAAGT TTCGCCTAAG 
AGTTCACGGT AAAGGAGCGT 
TCCGCAGTGG GGACCTTTTG 
TAAAGATGAA GGTGGACCAA 
CGGTTAAGAT ATCCTGCGAG 
TCCAGGAAGG TAAGCAGACT 
CCGAGTCGGT GCTCAAGAAG 
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GCGATCTGGA 
ACCGCGGGTA 
ACAGTGCGCA 
GTTGGCCGCG 
AGTTCATAGC 
GGGGAGAGTT 
CACGCGATAC 
ACACGCAGGG 
TTGCGCCAAA 
GACAGGTGGT 
CTTCAGATAT 
TTTTGGATTT 
ATTTGTTCAA 
CCCTCTACGT 
CTCAGTATGA 
GCTGTTCCGG 
GTCGACGCCG 
GAGTTTTGCG 
aGTGCCTTCC 
AATTGGTACG 
CGCGGCCAGT 
TACCTGGAGA 
GCGGTGCCCA 
CCTTCGGTGA 
TCTAAAATCC 
GAGTCGCTCA 
GCAAAAACTG 
GTTGTCTCGC 
GTGCTTTCCA 



W13041 

GCTTCGTATA ACGCCGCAGT TGTTATCGAT 21120 

TTTTTTACTG AATTTCTCAG GGATTGAAGA 21180 

GAATTTTCAG ATGAATCTTT TGCTGTCTGC 21240 

CTCTGATTTT CCTATTTTGC GGACGCTGCT 213 00 

CGATCCGGTT TCCGGGTCAT TTTATGTTCG 213 60 

TTTTTGGATA AAACGAAACT TTTACCTCCG 21420 

CCAAACGGCC GATCCGCGTA TTTCGTTTCG 21480 

GAGGCCGGTG AG CTTG ATTT TGTCCGCTGA 21540 

GCTCAGGTGT GATCCGCCGG TTTCTGAGCA 21600 

GCTGGGGGAT TTGACAGAGG AGAATATTGA 21660 

TCTTACGCAg TGGGGGATTA TGAAGCGGGT 21720 

GGACGCGTTT TCGTTCCGCA CCTATGTTCT 217 80 

TAAGGACCGC AGCAAGCCGC TGACAGTGGG 21840 

AGGGCGTCGT CTTGGCCGGG CGGTGTACGC 21900 

TCCGcTTGCG CCAAATAATT TGGGGATTAA 21960 

CCGGAGCTGG GGCTAGAGTT TGCAACGCCC 22020 

ACACGTCTTG ATTCACTGTT TGTCTCTGAT 22080 

TATTGAGGCT CAAGGCAGCT CGTAAGGAGA 22140 

TAATTGCAAG TTGTTGTGTG ATGTCGCTGG 22200 

AGGGAAAGCC TATCTCTGCG ATTAGTTTTG 222 60 

TGGACACGAT TTTTTCTCAA TACAAGGGAC 22320 

TACTGCAAAA GGTCTATGAC CTTGAGTACT 22 380 

CCGATCCGGA GTATCAGTAT GTGATGCTAC 22440 

AGGGCATCAA GATGGTAGGG AACAGCCAAA 22500 

TCCTGAAAAA GGGAGACATT TACAATGAAG 22 560 

GGCGTCATTA CCTGGACCAG GGCTATGCGG 22620 

AGGCgGGGGG CGTGGTGGTA CAGTTTACCA 22680 

GGATACAGTT TAAGGGAAAT AAGGCGTTTA 22740 

CGCAGGAGGC GCGTTTTTTG ACCAGTGGGG 22800 
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TGTTCAAGGA GAATGCGCTG GAAGCGGATA 
GGGGATACAT TGACGCGCGG GTAGAAGGCG 
CCAGTCGCAA TCTGGTTACG CTTACGTACA 
GCGGGGTTAC CATTGTGGGT AACCAGATTT 
GGCTCAAGCG CGGGGCCATC ATGAATATGG 
CGGATGCGTA TTTTGAAAAC GGATACACGT 
ACACGGCGGA GAAAACGCTT TCGTTTAAGA 
TCGAGCACAT TATCATTaAG GGAACGAa GA 
TGCTGCTGAA ACCGGGGGAT GTGTTCTCTA 
GTTCAACCTG cGCTATTTCT CGTCGCTGGT 
CCTGGTGGAC ATTATCCTGA ATGTGGAGGA 
GACGTTTTCT GGGGTGGGGG AGGCAGGCAC 
AGAAAAGAAT TTTTTGGGAA AAGGGAATGA 
GGCGCAGAGC CTGAAGCTCG GGTATGTGGA 
GGGCTTTGAC TTTGAAcTTA CGCACAAAAA 
CAACGGGcTG CCGCACCCgT ACACGAGCAG 
AGAATCGTTT CGCCTCAAGT ATTCGCGCTT 
CCAGTGGTAT CCGCGCTATG CGGTCATTAG 
AAAGAATTTT TACGATAAGG ATAACAATCA 
GAACTGGACC AGTATCAATT CGTTTTGGAC 
GTACGACCCG TCCAGCGGCT GGTTTTTAGG 
CTTTCTCGAA AAAGAGCATT CGTTTCGCTC 
GCTCAATTAT CCGGTCTCTG CCGTGTGGAA 
TGTGTCCGTT CAAACGTATT ATGGACGGAG 
GCGGTCCGGC GCGCTGGTAA TAGACGGCGT 
AAAGAAAAAC ACCGGAGACC TGCTGCTCCA 
GCACGGCATT GTGTCCTTTG ACTTTTTCTT 
TCAGTCCCCA AACGGGTCAT CGTCCGCCAG 
TAGAACCACC AGCTCTGAAG GACTGTACAA 
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AGGCGGCAGT CCACTCATAC 
TGGCAAAGAC GGTTGATAAA 
CTGTGGTGGA AGGTGAGCAG 
TTAGCACCGA GGAGCTGCAG 
TGGCCTTTGA GCAGGGCTTT 
CAAATTACCT GAACAAAGAA 
TCACGGTGGT GGAGCGCGAG 
ATACAAAAGA CGAGGTTATC 
AGTCAAAGTT TACGGATAcT 
GCCGGATGTG CGGCCCGGCT 
GCAGTCGACG GCAAACGTGC 
GTTCCCCCTT TCGCTCTTTT 
AATTTCAGTA AATGCAACCT 
GCGCTGGTTT CTGGGCTCTC 
TCTCTTTGTG TACCGCGCGG 
GGAGCAGTGG GCTAGTTCCC 
TGAGTCCccC ATCGGCGCGC 
GGTGAACGGG GGGGTGGACT 
GCCCTTCGAC CTGACCGTAA 
GAGCGTTTCG TTTGACGGGC 
ACAGCGCTGT ACGTTCAACG 
CGACACCAAG GCCGAGTTCT 
CTTAAAGTTT GTCTTGGCTT 
GAAAAGCGAA AACGGAAAGG 
GCTGGTAGGG CGCGGGTGGA 
CCACTGGATT GAGTTCCGCT 
TGATGCGGCA ATGGTGTACA 
CAGCTCCAGC AGCAGCAGTA 
AATGAGCTAC GGTCCGGGGC 
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TATGCAGAGA 22860 

AAAACTGACG 22920 

TACCGCTACG 22980 

GCAAAAATTA 23040 

CAGGCGCTGG 23100 

GAACACCGGG 23160 

CGCAGCCACG 23220 

CTGCGTGAAA 23280 

TGCGC AATCT 23340 

CTGAGCAGGA 23400 

AGTTTGGGGT 23460 

GTC AGTGGGA 23520 

TGGGGTCTGA 23580 

CGCTGACGGT 23 640 

GTTCATACGG 23700 

CTGGGCTGGC 23760 

ACACCGGGTA 23820 

TTCGGGTTGT 23 880 

AAGAGCAGCT 23940 

GTGACTTTGC 24000 

GGCTCGTTCC 24060 

ACGTf ACCCT 24120 

TCTACACCGG 24180 

GCAACGGGGT 24240 

GCGAAGACGC 24300 

GGCCGCTGGC 24360 

ACATCGAAAG 24420 

GTAGTAGCAG 24480 

TGCGCTTTAC 24540 
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ATTGCCGCAA TTTCCGTTAA AATTGGCGTT CGCAAACACC TTCACGTCAn CCGGCGGCAT 24 600 



CCCAaAAACa AAGAAAAATT GGaATTTTGT GTTGTCGTTC ACGGTAAATA ATTTGTAGCG 24660 
TTCCCGTGnC CGTTTTGAAA nGGTCCGGGG GCTGCGTCC 24699 
(2) INFORMATION FOR SEQ ID NO: 27: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4637 base pairs . 

(B ) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

TGCCATGGAA ATGTGACCTA CGCCTCCCTT TATGGGCCAC AGGCTGGGAA TGTCAGAAAA 60 

AAGTGCTGTT TGCGAATTGA GCAACTTTCC TATCTCCCGT ACTGGCTGCA CAGAGTTCTG 120 

AAAATAGGCG GCTATGCGCC GCAGTTCCTC TGGCTCCGAC CCAACGCCCT GCGCGTTTTT 180 

GCCATGCTGT TCTGTCACTC CAACGTTTCC CTTTTGTGCT CCATCCTTTG CTGACGCATC 240 

ATCCGGCTCG TTAGACTGGT TTGTGTCGGA TACACTTCGT ACTGGTTTTT CTGCCCCAGA 300 

TTGGAGCCCA CTGAGGCCAA CCTGAGAGAA AGTTTGGGAA AGGGAAGTCT GGAACGCCTT 360 

AGAAGTGCGA ATAAGTTCTG CTGCTTCTTG GCGCAGGAGT GCAAGGTGCA CCTGCGTTCT 420 

GTCCACCTCT GCGCGGACCG CCGCCAGGAA CgTGCGGAGG TGACAGACTG ATGGGCAAAC 480 

CAAAAAAAAG AAGCCAACAC GCCAACACCG CACAGAAGAA CGCACAAGAG CGTGCGAGCG 540 

GTAGTACAGA AAGTCCGGGC CGCACACTGA GAGTGTGGTA CTAACATGAC GGTGAGTTCT 600 

GCACGGCCTG CGGCGCCCAG GCGCGCTGCG GCTTCGCGTA CAAGCCGCGC ACAACGCGTA 660 

CACCGGTCAA TACAAAAACG AACGAACGCG TACTCAATAC GCTTGTATCT TCGGATGCGT 720 

GCCATGCGCA AGACCTCTCC GGGGAAAACG GTATCACCGC GGTTAGATAC TGTCAAACCG 780 

TGGAACTACG GGACGGTCTG AGCACGAGGA CGCGGGCACC CAACCCAAGC TTCGGTTCTA 840 

CTTGCTCTTT TCTTTAAAGA GGACCAAGAG GGCACACGAG CCCCAACCCT GGCCAGGAGC 900 

GAGCACTGGC TTGGCCCCCG CGCTGAGGGA AAAGCGGGAG GACTTTTCCG TGCTGCTATA 960 

GCTGATTGTT GATATCAGCC AACGTTTTGC GCAGGGCAGG GTTCTGCGCG TAgTAGTACT 1020 

GCTCGAGctC CGCTGCAGTA AGCCCAGAGA GCTCGTGGCT CATGTAATGG AACCACTCGT 1080 

TATCTGCCTC GCTACGTACC TCGTGCTTGA TGCACCAGTA GTCCGGCAAC ACCGGGGGCT 1140 

GTTTTTCTCC CACGAATACT TTGTAATTGA AGACAGCCAC GCGGATGTCG CCACGCGCAA 1200 
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GGCCCTTTTG CATAAGCAGG GAAGCGATGT AGTTGATGGT TGCGCCTGAA TCGAAGATAt 
CATCTACGAT GAGCACCTTA TCCCCGACGC GTAGGTACTC AGGAGGGTAG GTCCAGCCAT 
CTACGCTGAT GAcGCGCCss TTACGCAAAT CACAGTGCGA GTGAGCAACT ACCGCTGCGT 
ACAGGATAGG AGGCTCTGCC TTGTACGCGA TGGTTAAATA CTCATTGAGC ACGTTACCCA 
GATATACTCC AC CCCGTATG GGGACGTACA TAACCGTTGG CACGAACCTG TCTGCCACGA 
TGCGCCGGGC CATACCGAAA CCCTCATCAC GGATCACATT GTACGGAATA AATCGCTTTT 
TCACGTTAGC CTCTCCTGCA CGAGCACGAA AACACCCTAC ATCTAATGCT TTTTTAGCAT 
CATGGCAAGC TCTTTTTCTA TTCGTGTCGT GGCCTGGAAC TGTCTTTGTT GAaAGTTCGC 
CTGAATATTT TATGCTCCTG CGCGAGGGCC CCCGTGATAG AAAAGTTGGA AGAACTGCGC 
GCTCAGTGGA GAAAACTACA GCAGGAAGTG GAGAATCCTT CGCTTTTCTC TTCCACTCAG 
AGTTATCGTG AACGTATGCG CGATCACGCC TATCTTTCCA GACTGATGGA AGAGTATGAT 
CGCTATTTGC TTACTGAGAA GCAGTTGGaA GACGCGCACG TTCTCATCCA AGATGAGTCG 
GATGCTGATT TTAAGGACGT TATTCGGCAA GAGATCCGTA CACTTGAAGC TGCACTGCAC 
ACGAGTCAAA AGCGACTAAA GACGCTGCTT ATTCCCCCCG ACyCTTTGCA AGAGAAGAAT 
ATTATCATGG AAATTCGCGG CGGTACCGGC GGTGATGAAG CAGCGCTCTT TGCTGCAGAT 
CTATTTAGAA TGTACACGCA CTACGCTGAG TCAAAACAAT GGCGCTATGA AGTCCTTGCA 
GTGAGCGAAA CAGAGTTGGG AGGATTTAAG GAAATTACGT TCTCTATCTC GGGGCGCGAT 
GTGTATGGCA GTTTACGTTA TGAATCGGGT GTGCATCGCG TTCAACGTGT CCCTAGCACT 
GAAGCGTCGG GGCGCATCCA TACCAGTGCG GTTACCGTTG CAGTGCTGCC TGAGATGGAA 
GAG AC TGAAG TGGACATTCG TGCTGAGGAC GTGCGTGTTG ATGTCATGCG TGCAAGTGGT 
CCTGGTGGGC AGTGTGTCAA CACCACTGAT TCTGCGGTGC GTCTTACACA TCTAcTACGG 
GCATTGTCGT TGTCTGTCAG GACGAGAAGA GTCAAATCAA AAACAAAGCC AAGGCCATGC 
GTGTATTGCG CAgCAGAGTG TATGATTTAG AGGAATCGAA GCGCCAGGTT GCCCGTGCAA 
GGGAACGCAA AAGTCAAGTT GGTTCAGGGG ATCGTTCCGA GCGCATTCGC ACGTATAATT 
TTCCTCAGAA CCGTGTTACG GATCATCGCG TGCGTGTTAC GCTCTACAAG CTAGATGCAG 
TGATGCaGGG TGCGTTGGAT GACATTATCG AGCCaTTGTG TATTGCGTCT CGAGAGAGTG 
TAATCTAGTG CAAGAACTCT GTACGATTCG ACAGGCGCGT ATGTACGCGC GAGCGTTGTT 
TCAAGACGCC CCCTGTTTGC GCGGACAGAA CACACCGCTT TTAGATGCAG ACCTTATTCT 
GTCgAAGTTG cTTGCGAAGC CGCGTGCGTG GATTCTCGCC CACCAGCAGG ATGAGATTGC 
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CTCCGTTGCA CACGAGTTTA AGCGTCTCGT GCATCTTCGT TGTAgGGGAC GTGCGTTGGC 
GTATCTGACT CGAGAAAAAG AGTTTTTTGG TCTGAGATTC CGTGTCACCC GTGTACGCTT 
ATCCCTAAAC CGGATACCGA ATTGCTTGTA GAAAGTGTCC TGGCGCACGT TGCGTCCCAA 
ATGATGAAGC CGCGTTCAGT ATCTGTGCAT AAAGACACAA GTGCACTGCC TGTCTTGAAG 
ATATTCGAGG CGTGTACGGG ATGCGGGTGT ATTGCCATTG CACTTATGCA TATGTTGCGT 
GCGCtGGCAC GCCACCTCTC TATGTCATTG CATCCgACAT TTGCATGCGG GCCcTTGCCG 
TAX S GCGGTA TAACGCGCGC CGACTCTTGG ATGTATCTGC AAATTCGCGC GTAcGTTTCG 
TGCACGCAGA TGTGCGTGCT CCTATTCCGT TCTTTTCTCC TTCTGAAGGC ACGGACnTGG 
TACAGGAGCG CGGGGTGTGC GTTC CGTATG ATGTGATATG TGCAAATCCG CCTTACtACC 
GAGTGCGCAA GCGCGCGCGC TGTTGCAGGA CGGGAGAGGG GAGCCTCTCG GTGCCTTAGA 
TGGGGGTGCA GATGGGCTAG ACTTGGTTCG CGCATTCGCA CACCACAGTG CCGCAGCGCT 
AAAGGAAGGC GGGTGCGTGT TTTGCGAGGT CGGCTCAAAC CACGCACAAC GTGCAGCGCG 
CATCTTCCAG GCAGCAGGGT TTGCCACGGT GAAAATTTCA AAAGATCTCT CCGGGAAAGA 
GCGCCTGATT AGCGGGATAc TGCGCTCGCA GTCTAGAGCT GTAACAGCGC CGAGTGGCTA 
GGGTGAAACA CGGCGACTGA GTGGTTATCC TGGCGTTTGC AGGTGGATGT nCGCGCCGCG 
TTGGCCGATA GGCTGAGTAC ATGAAGGAGT TAGAGATCAT CCACCATTGC GGATGACTTg 
CGTACGsGrT TGATTTTGCT TCAAAAAAAT CGGTTTTAAT CAAGTTTGCG TTGCTGTACT 
GACTTACCCA GCTCATCGAT TCCGGTTCTA CACGGTGCCC CTCGTACAAG GGCTCAAAGC 
CTAAATTTTC GCAACGAAGA TTACCCAAAT ACCGGATATA GTCTGCCACC ATGTGGCGAT 
TTAGTCCAGG GATC TGATCC C C AATG AC AT AGTCCCCCCA CTTAATTTCT TGTTCGCATC 
CTTCGCGAAT CATATCGCGA AATAAGCGTA CATTGCGTGC AGTGAACACC TGwGGCTCTT 
CCTTTTGCAG TTCTTGAATA ATGGATCGAA AAAGC CAC AG GTGTGTGTTT TCATCGCGGT 
TGATATAACG AATTTCCTGC ACCGAGCCGG GCATCTTGTT ATTACGCCCC AAGTTATAGA 
AGAACATAAA ACCCGAATAG AAATAAATTC CTTCCAAAAC ATAATTCGCA ATTGCTACCT 
TCAGCAGTGC GAGTACGCTT TTGTCATCTT GAAACTCGTT GTACAAGTTG CCAATGAATT 
TATTGCGCGC AAGCAGGATG CTCGTCGTCC TTCCACTGGT ATAGAATGTC ATGCGTTCTT 
CGGGGGAGCA AATGGTGTCC AGCATGTAAC TGTAACTCTG CGAATGCACA GCCTCTGGAA 
AGCCTGAAGG TTAGGCACAG TTAATCTCAT TGC GGTAAGT ACTGACCAAT ATTGGCAGAT 
CGCAGTCTGG GATGCTA 



3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4637 
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(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10820 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : doubl e 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

TGTAGACGGG GCACTGAGTG CTGAATGCGC AACGTCTCCA CGAGAGATTC AGAAGGACGC 60 

ACGGGTCATG CCCCCACCAA GAATCGTGAA ACTGTTTTTC CTGTCACGCG CAGAAGCATG 120 

CCGgCTTATT TTGGTTCTAA CGAACATTTA TACGCACGGC AAAGAAGTGG GTGAGCAGTG 180 

CCACACCTCC CGCTCCTGCC AGCATGGAAG TGTATACCAC TGTACGCGCA TTGAAAAAAC 240 

AAAGTGTCAT GACTGTTGCA GAAAAAGCAC CCATGAGGAG GAATATACAC AACGCAGCGA 300 

TTAACCCGAG GAGTGGCAAG CGCGTGCGCA TAGAAAAAGA AACATGCGCG TGTAACTGCA 360 

GCAGGTTAGT CATGTACAGC ACGAGTGTCA TGACACGCGC CAGAAGGTCA GGATCATTAA 420 

GTTGTGTATA CACGTGCACA AAGAAAAACG ACGTATACGC GCCGAAGAGC GCAGACACGC 480 

CCGATACGAG GGATACACGC TCAAGTGAGC GAGAAGTAAA GAGGAAAAAC GGCAGGCAAA 540 

AGAGGGGAAA AACATAATCA AAAAGAAAAA AGCGCATCCA CTGCTCTTCC ACAAGGGCGG 600 

AATCCGGCGG ATAGTACCCG AGAAAAAATG AACGAAGTAG GAGGAGAGGC ACAGCAAGCA 660 

CGGCGCCGTG CACAAACGAA ATAAGTTCCT GAAGTGGGTC CCCAGCGTCC GCGAAAGAGG 720 

AAAAGAACAA AAGAGGCAAC GAAATAATGA GAAATATTTC CACTATCGAT ACCGTAACCA 780 

ACGGGGCACC GCATACGGCG AAAGATCACC GCGGGTATGC GTTCGCCCAT CAAAAGTGCG 840 

CCGCTTTACA GCACAAGCTC GCTTGGAATA TCAGCGTTAC TGTAGACGGC CTGAACATCC 900 

TCTTCTTCTT CGAGCCGGTC AATCATCTTC AATACCTTAC GCGCAGTCTC CTCATCAAGC 960 

GCCAGGTACG TGTCGGGAAC CATAGATATA CCGGCAGATA GTGATTCCCA CCCCTTGGCC 1020 

TGAAGGGATT CTAGGACCGT CTCAAACGTA CCGGGAACCG TGGTGACGGT GAGGACACCA 1080 

CCGGCGTTCT GTATGTCCTC AGCACCCGCT TCGAGGGCAA GCTC CATG AG AGCCTCTTCG 1140 

TCAACCTGTT CGGAATCGTA CTCTATAACT CCTTTGCGAT TGAACATATA GGAAACGGAT 1200 

CCTGCCGAAC C TAAATT AC C CCCATTACGG GAAAACAAAT TGCGCACGTT CGCGGCCGCG 1260 

CGGTTTTTGT TATCGGTGAG CACCTCGACC AGAACGGCAA CACCGCCCGG CGCATAACCT 1320 

TCATAAACGA GCTCCTCATA GCTACTGCCA GATAACTCCC CCGTACCCTT CTTAATAGCC 1380 
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CGCTCAATGT TATCTTTAGG CATATTAGCG GCACGTGCCT TAAGGATTGC AGTCCTCAGA 1440 

CGTGGATTAG CCTGTGGGTC ACCGCCTGCC ATGCGGGCAG CAACAGATAT TTCCTTGATA 1500 

AACTTAGTGA ACAACTGCCC ACGCTTTGcg TCCGCAGCTC CCTTAGCATG CTTGATAGTG 15 60 

GCC CATTTAC TATGTCCAGA CATGAGATCT TTCCCCTAAT GCCCGAAAAT GTACGTACCG 1620 

GAACGCGGGC GCCTGATGCT AGCACGGTGT GCGCTTTTCT CCAAGTCCCG cTGGCGCATA 1680 

TTGCACGGCC CGCGTAATTA CCGCGGGCTT AAGAAGCACA GACCTAGCAC GTCGGCGCGT 1740 

GTGCTAAATC AAACAGATCT GCGTAGGCGC GCAACACGCC TTCCACCTCT CCTGCTAATA 1800 

TGTGTTGTGC AAGCGTCTTG CCCAGCTGCA CTCCTTCTTG ATCAAAGCTG TTCAAGTTCC 18 60 

ACGCAAATCC TTGG AACAT A ATCTTGTTTT CAAAGTGTGC AAGAAGTGCG CCGAGCGTTT 1920 

GTGGGGTAAG CGCTTTAGgT ATAGCAGACT GGATGGACGC TCCCCGGAAA ACGTTTTATT 1980 

TGCATCCGCG TGCTCTTTTC CCCTGGCGAA CGCGACAATT TGTGCGACGA CATTTGCAAG 2040 

GAGCTTCTGC TGACCGGTAG ATCCACGGAT TATCGGATCC TGCCCGAGCT GACTATGTTG 2100 

AAAGGCAATG AACTGAAGCG GCACCACCGA TGTTCCTTGA TGCAAATGTT GGTAGAACGA 2160 

GTGCTGACCG TTTG TCCC AG GCTCTCCAAA GATCACCGGG CCGGTCTTAT ACGTTATCGG 2220 

AATGCCGAAG CGGTTAACAC TCTTGCCGTT AGATTCCATA TCTAGTTGTT GCAAATGTGC 22 80 

AGGAAAGCGA GCCAACGCCT GGCTATAGGG CAACACCGCG GTGTGCTCGT ATCCCAGAAT 2340 

AGTGCGCTCG TACACACCGA TGAGCGCGTC AAGAAGTGCT GCATTACGCC GTATGTCTTG 2400 

TTCCTGTGCT GCTCGGTCCG CCTCTGCCGC ACCGGAGAGG AAGTGCCCAA ACACcTGCGG 2460 

TCCAAACGCA AGCGTGAGTA CCACAGCGCC ACAGACAGAG GAACTAGAGT AGCGTCCACC 2520 

GATAAAATCA TCCATGTAGA AGGAAGCAAG GTACTGGGGA TTATTTGCAA GTGGACTGGT 2580 

CTCGCTGGTA ACTGCCACGA ACTGTGTGTG CGGTTCTAGA CCTGCTTGAC GAAGgACGTG 2 640 

TGCGACGAAA AGCTCATTAC TGAGTGTTTC AAGCGTCGTA CCACTCTTTG ATACCAAAAT 2700 

AAAAAGCGTG GTCTCAAGCG GTAGTTTTGA GAGTACAAGC GCTGCGTCGT CTGGGTCCAC 27 60 

GTTGGAGATA AAATGTGTGC GCATCTTAAC CGCCTGGTGC CTCTGTGCCC AACCTTCCAG 2820 

CGCGAGATAC AACGCCCGTG GACCGAGATC TGATCCACCA ATTCCAATTT GTACAACGTC 2880 

GGTAAACGGT GCGCCGCGAG ACGTGCGCAG CCCCCTTCGT GTACTTGCCs TGCGAACGCA 2940 

CATaCTCTTT CGTATTCTTT TGTATAAAAG GCGTGCATAT CGCGCACTTC GCACGGCAAC 3000 

GAGGCAAGCG ATGACCCCTG CACGCCGAGG CGCGTTAGGT GATGCAGCAC CTTACGTTTT 3060 

TCCCCCGTGT TTATCTGTGC TCCTGCGCGC AGGcGTCGTA CTTTGCGACT AATTCCTGCT 3120 
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CGTCTGCAAG AGCAGCAAGC GCCGTGAGAA TTTCTTCATT CACTGTTTTC GCTGCGTAGT 3180 

GATAGCGCAG CCCAGCCCCC GCGTCGGTAC AATAGCGCCG CACACGTTCT ATCCCCTCTG 3240 

GTCCACAGAG TACTGTCTTC AGCGACGGCG CACGAATCGC CTGCAGGcGG GCGTATGCGG 3 300 

CACACTCGTC AAGATTTCTC CAATTCACTG CGCGTTCTCC TTTTATCGTT CTACCCCGTA 3360 

GggTTTACCT ACAGACATAT CGCCGGCTGT TCTATGTATC AAGACGCGGC ACGACAATCG 3420 

TCGCGAGTGC CGGGTTCTTT TCTAAATCTC TTTTTAATCC TGCTGCCCGC GCCTTATTGA 3480 
CATAGGTTGG ATCCTGGAAC GAGTAGGTAA AACGCGTGTG CACGTCGTAT GTTCCCTGCA . 3540 

TGAGTGCATA CGTGTCCTCA GTCATGACCA ATTCTTCATC TGTTGGGATT ACCAGAATGC 3 600 

GGACGGGTGA ATCGTCTGTA CTAATTTCAG TTTCTGCATT GCGCGTGCGG GCCAGTTCAT 3660 

TTTTTCGCGC ATCAAGTCGG ATGCCTAGGT GTTCGAGTCC TGCGCACGCT GctGCGCGTA 3720 

CGTCGCAACA CATCTCTCCA AC AC t GCGGT AAAGACAAGC GCGTCCGGCT GTTTACCCAA 3780 

AGCTGCAACG TATGCGCCGA AGTATTTCCG GATGCGGTGT ACCTCCATGT CAAAGGCAAG 3840 

GCGTGCAAGC GCGTCTCCAT TTTTCATGGC AGCACACACA TCGCGTCGGT CCACGTATTT 3900 

TCCGGTGATG CCTAGCAAAC CGGACTGTTT ATTGAGAGTG GTGTCGATGT CTGAGACAGA 39 60 

CATGCCTGTT TTTCTCATAA TGTAAAAGGC AAGCGCAGGG TCGCAGTCCC CGCAGCGTGT 4020 

TCCCATAATC AGGC CTTCTA GCGGGGTGAT GCCCATGGAA GTGTCAAAGC TGACACCATT 4080 

TTTGACACAA CACATGGAAG CGCCGTTTCC AATATGCGCA ATGATTATGT TTGTGTCCTC 4140 

AGCCCTTTTT TTGAGAATGA CAGAGGCGCG CTTTGC AG T A TAAAGAAAAC TCGTGCCGTG 4200 

AAAGCCGTAG CGACGTACCG CGTATTCTTC GTACCACTGC CGGGGCACTG CGTACATGAA 42 60 

GCTAGCTTCT GGCATGGTTT GATGCCACGC AgT ATC CAT A ATGGCACAGT GGGGAACTGA 4320 

GGGGATGACC GCCTGGGCAG CCTCAATACC ACGGATGTTT GCGGGGTTGT GGAGAGGGCC 43 80 

AAGGTCTTGA ACAGAGCGAA ATGTTTCTAG CACGTCAGGA GTCACAACGA CAGACTTTAC 4440 

AAAGCGATCT GCTGCGTGTA GGACGCGGTG TCCAACTGCC TTGATAAGAC TCATGTCGCT 4500 

GATAACACCG ACGTGCGCAT CGGTGAGGGT GCTGATGATA AGCTGCACCG CTTCGGTATG 4560 

GGTAGGGCAG GGACTTTCCC GAACGTGGTT CTCTCGGCCG TGCACCTCAT GCGTGATAAC 4620 

AGATCCTGCC TGAGTAACAC GCTCTACCAC GCCGACGGCA ATCACCGCAC GCTCTGTCCA 4680 

GTTATACACC TGGTATTTTA CAGATGAACT GCCGCAGTTT AGCGTGAGGA TAATCATAAT 4740 

ACACCTCCAC CGTTTTGGTA ATTTCTCGGA CACCGTAGCA TACACGCAAA ATGCGCCACT 4800 

TTCCTACACC GTTGGc TTAC ACTGc TTACG CGGATATAGC CCCCGCAGCA GCGtaTCCAG 4860 
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CAGCCACTGC GCTTTGGCAG TGTCCACGCG TGCAAGGGcA CGGaCGCGTG GGGATCTGAC 4920 

GCGACTGACG CGAGCATGTC CTGCTTTTCA CGCCCGGTAA CTACGACGTA aTTTCGTGTG 4980 

CATTATTGAT AAGGTGCCCA GTGAAaCTCA CACGTTTCTG ACCGGTGTCT GGGTGAGTTG 5040 

CCACGACGCA ACAGCCGCTG TGGTCCCACA GCTCTATTTC ATGAGGGAAA ATAGACGCGG 5100 

TGTGTCCATC CGCCCCCATA CCCAGGAGTA TAATATCAAA GCACGGCACG CCACGCTGTC 5160 

TTGGGAGCCG TGCTTCAATT TCCTGTGAGT ATGCGGCGCA GGCGCTCTCC GGGGCGTCTT 5220 

CTCCCCTGAC GCGAAACACC GCGTCAGgAT TTATTTCCAG AGGCTCAAGG AGCGCACTAT 5280 

GGGTCATGTT GAAGTTACTC TGCGCATCCG TGGGGGGTAC GCAACGCTCA TCGCTCCAGA 5340 

AGAAGCGGAG GCGCTTCCAA TCAAGGTGGT GTCGAAACTC GTGCGCCCAA GTTCTGAAAA 5400 

TCTCCCTTGG AGTGGAACCC CCCGACAGGG CCAACCAGAG AATCTCTTGT GTTTTGAGCC 54 60 

GAGAATCAAA CACCGAAACG AGGAACGCCG CGATGGCACG CGCATCCTCA AAAATATGCT 552 0 

TCTTCATGGG CGAACATCCT CCTCTCTCCC GCTACGTTCT AGTGTCGTTC AAGGCTCGGC 5580 

ATTACCGCTA GAGTCGGCAG GCAAAATCAT CGCTGAGCAG CGTAGAAGAG GGGTGATGCC 5640 

ACCGTGGAGC ACTCCCTTTG ATCAGGTCGT CTGCaGCtTC GGACCCCAGC TTCCTGCAGG 5700 

GTACGTAAGT AGAGGACTCT TGTTTGATTT CCATGCGGCA AGAATAGGAT CTATGAAGCG 5760 

CCATGCAGAC TCCACCGCGT CATCTCGATG GTAGAGCGTG TTGTCTCCAT TCATGCAGTC 5820 

AAGCAATAGC CGCTCATACG CGCTGGGTAA GTGCGAATAG GTAAGAGCCG AATACTGAAA 5880 

ATCAACACTG ACGGGAATAG TCTTGAACCC CGCGCCGGGC TCTTTGAGGT CGATTTTAAG 5940 

CTGAATTCCT TCGTCGGGTT GAATGCGAAT GACAAGCGCG TTGCCCTCGC GTGCGCACGG 6000 

GCGTTCGATG TGCTCGAAAA GCGCGATGGG GAGCGTTCGG TAATGGACGA TCACCTCAGT 6060 

GACGC CCGTG GGCAAACGCT TACCCGTCCG C AG t AG AAGG GAACGTCCAT CCACCGCCAA 6120 

TTGTCGATGT AGCACTTGAG TGCGGcAAAG GTTTCAGTGC ACGAGCGAGG GTCAACGCCT 6180 

GACTCCTCAA GGTAGCCGGG GACGGCTACA CCGCGTATCT TGCCGGCGAC GTATTGGGCA 6240 

CGCACCGTAT GCTGCATGAC GTCGCGTTCT CCCATAGGGC GCAGGCAGTC AAAGACCTTT 6300 

ACGATTTCAT CCCGTAGACG ACTTGAACTC ACGaCGGCGG GCGCCTCCAT CGCGATAATA 6360 

CCCAAGAGGA GTAACAAGTG GTTTTGGATC ATATCGCG C A ATGCACCGGA CTGGTCGTAG 6420 

TAACCGCCGC GGTTTTCGAC ACCTAGTGAT TCGCTTGCAG TAATTTCAAC GTAATCGATA 64 80 

TGGGTCCGGT TCCATGTGGG CTCGAAAAGG GGATTGGCAA AGCGAGTGAC CAGGATGTTT 6540 

TGGACCGTTT CCTTACCCAG ATAGTGATCG ATGCGATAGG TTTGGTTTTC CTGAAAGTGG 6600 
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GCACGCAAGC TCGCATTAAG GTGcTGCGCG GTTTCTAGGT TGTAGCCAAA GGGTTTTTCA 
ATAACTACCC TGCGAAAATT ACCCTGTTCC CGGTTCAAGT GGTGCATAGC AAGCTGCGTG 
GGGATAGTTT CGTACAGGCT AGGGGGAGTG GCAAGATAGA AGATAAAGTT GCCCTCGGTG 
TGCAGCGACT GGTCGAGGGT GCGCACGTAC GTGGCAAAGT CGGCAAAGGC GACAGAG TCG 
GTGGGATCGA ACGAGAAGTA GTGGATCTTC TGCAGGAATT CGGTGAGGCG CGCCGGGTCG 
TGCGGTGTGC GCACTGCATG CTTTGTGACC GCCTCTGCAA GCCGTGCGCG AAAAGACTCT 
GTAGACAGAG CCGTACGCCC TGCGCCGAGT ATACCGAATG TAcGGGGCAG GAGCTCTTGC 
TCAAAGAGAT CCCAAAGCGA GGGGATAAGC TTCCGCGCGG CAAGGTCGCC TGAAGCGCCA 
AAGATAACCA GGATGTGCGG CGCGAGCGTG CCGCTGCCAC TGATTTTCCC CATAAACCGC 
CCCTTCTTTC AACGGTGCGA CCTACACCGG ATGTGCCGCA GGsAaCTCTC CGCTCCCTAA 
GGCACTAAAT GCGGAACACC GGCCCTATTT TTACCATGAC CAGCGAGGTG CAGCAATACT 
TGGC C CAT AT GTTCGACCAC GTCAGGTCCT GTCCATTCCC ATTTGCGCCC TTTTTCAAAT 
TCTTGGTGAG CGGCACGTTC ACTCCGCTGC CGAAG t G AAA TCCAACCCAA CGTGCTTGGT 
AAAGAAAATG AGGAAATCCA AATTTATCGG TATCCCCACC AGTTTGTCGT CCTGCGTAGA 
AAGGATATCC ACGCCCAGAG ACGGAATAAA TGCAAAATTC TTGC CGCTGC GTATAGCCCA 
GCCCACCAGG AACTGCGCAC GGAACATAAG AAAGGCAAGC CCCGCGTCTA ATTCTGTCGT 
GAACGCGAAg CCGTTGTGCG CTATTACCCC CACTGCCAAA CCGAGCGTCG GGGTGTACAA 
AAGAACGTCG GTCCTGGGGG CCGGTTCCTT TCCCCAGGGA TGCGCCCCTA CCTGTCCTAC 
t TCGG AGAAA CAAATACCTC CGCCGCAAAA ACGCCTGCCC CCATCCCGAG CGCAGCGAGC 
AAAGAACcTA CGCGCACCAc GCACCGCGCC CGTACCCCCC CCCTCGcCGT GTGCCACTGT 
ATACCCATAC GATCTAACCC CAGCTGTAGG ACACGCCTAC TGGGCCGATC TTCCCGCGTG 
GGGTGTGAGA GTGTCAAGCC CTCCCCCCTT CCTTGCGAAG AGGAGTATGC CAAACGGTGA 
GAAAAACTTG ACGGCGCGCG CTAAACGCCT AATAATTGCC TCGCAGCCTT TAGAAAAAGG 
AGGAGCTCGT GATTCGCGCC CTCTTTTCCC TCTTTCGGTC CCTCCATGCA AACACGCACC 
CGGCAGATCT CGCGCATGCG GCAGCGTTGG CACTGGCCCT CGCG TTGCTT CCTCGGAGTT 
CTCTCCTGTG GTACCTACTG TTTGCCGTCT GCTTTTTTAT ACGGCTGAAC CGTGGTCTGC 
TCTTGCTATC GCTCGTGCTG TTTGGTTTTG TCGTTCCTTC GTTCGATCCC TGGCTCGACA 
GCCTCGGCAA TTGGGCGCTG TGTTTACCAC GGCTGCAACC CGTCTACCGC GCCCTGATTG 
AGATTCCCTT CGTAGGGCTT GCGCGC TTTT ACAACACTAT GATTGCCGGC GGTCTGGTGG 



6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7320 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
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CAGGTGCGCT GTGCTATTTG CCGTGCTATG 
GTACATACCT GTACCCTAAA ATTCACCATG 
CGTTGTGCAA AAGGTAAAGA AGATACTCAG 
TCTACACCCA AGAC GCCTTC GCGCCGGATT 
CGGTTCTTCT GCAAACGGTA CACTCCCCGT 
ATCCCTGCTG ACCG CGCGTA CTGCATGCGT 
CGTGTGTTTG GCCGCACGCT CCTTTCTCGC 
GCGCACTCAG CGGACCTGAA GCGGCTCAAT 
GGGCGGGTTA ATTTTTGGTC CCTCTCCATG 
CTCGTGTACT TGATCCGAAA TGTCATTGCT 
GTCTTTGGTG CGCGGTGCGA AgCGGCAGTG 
CGCCTGAAGA ACTATGCGGT GGCAAACAAG 
GAAAGTATCG ATATCCACTT TGACC TCCTG 
ACGATGGTTG TAGAGGGCGT GACGTGGAAC 
CCGCGCCGCG CAAAACGTCA ACGTGTGCGC 
GAAAAAGCGG CGGAGCTGGC CGCCCCCGTG 
GCGCAAGTGG ACCCGCGCAT TCTCCTTGAA 
CTCGTACAGC ACGTGGGTGC GCAGGCGCCC 
TTTGACGCAC ACGCCCGTGC GGAAAAAACG 
GACTTTCACG CTTTAAAAGA CGTGTCGGCA 
GCGCGCCGAT CCACTGAGGA AGCCCTCGCT 
CAGGATGTGC ATTCGACATT GGGTCTTG CG 
GGTGCGCGCA TCGyCCGTGC cGCGGCGGCT 
AAATTTATCT CTGGTCTTTG CACCGTCTTT 
TATGTGGCGC AGATGCTTGA TTATGTCCGG 
CCGTCTGCGG AGGCAGAAAA GACAGCTCAG 
GTAATTTTTT GTTTGAGCGC AACGTCCCTT 
CCGCAGATCC GCAGgCAAGA TTTTCTGTTG 
CGCACGGGTT TGGCGAACCG ATTTCGTTCC 
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CTCTTGC AC r CTGCGCGGTG 
CGACGATTTT CTTTCTTGTC 
CGTCAGGGAG AGGTTTTCAT 
CGGCATACAG GAAGGAGGCG 
TCTCTC AAGC GGTTCCTGCG 
TACCTTGCAG ACCCCGTATC 
ACGTATGTTC GCTTCGATCA 
GCCATTGCAG CGTCAATAGC 
GCTTGTGCGA GCGTCCTCGC 
CGGCGTGTCG TTATCGGTGG 
GTAGATCTTG ATCTATTCAA 
CATCATCCCA TGTGGAATCT 
GAGCTTTCGC GGGGTAAGTT 
ACGCCGCGCA AAACGTCTGG 
AGTAGTAACC CGCTTATTGC 
TCTTTTGGCG CAGGGTTTTC 
CGCGAGGTGA AGGCGTTAAA 
AAACTTGCAG AGCGCTGGAC 
GTGGCGGCGA TCCGTGCGGT 
ATAAAACAAG GTATCGAGAC 
ACTGCGCGCA CTATCTCCCA 
CGCGAGTTCG CCGCGGCGGT 
ATCCGTGATA TCCAGGCAGA 
TTGGCACGGA GCTTTAGCCA 
GGGTCGCAGC GAACACCGTC 
AGCCTTACGA CGCGCAAGCn 
CCGTGCTGCT GAGAAACATT 
CAGCCCGTGT GCGCAATGCG 
TCCTGGACGT GGCTGCAGGC 




13041 



ACGGCGTACC 8400 

CGGAACGCCC 84 60 

GAGCGATGAT 8520 

CACGCTGCAT 8580 

CCGAATCCAT 8640 

CACCCCTGTC 8700 

GCaGGCTATC 87 60 

AAAGCAAAGG 8 820 

GCTTCTCGGG 8880 

TTCTGAGGCC 8940 

CGCGCGCTTC 9000 

GTTTGAAATC 9060 

CGTCTCACAC 9120 

TGCTTTGCCC 9180 

AAAAATACAG 9240 

TGCGCTCAAA 9300 

AACTCCCACC 93 60 

GcAGCGTGTG 942 0 

GACTGAGCTT 9480 

GCTCGATAGA 9540 

CGAATTGCAG 9600 

aAAGGCAGAC 9660 

TGGAGGAAAG 9720 

TTATTACCCC 9780 

TGATGGATCG 9840 

CTTGCAGGGA 9900 

GGGGTGTCTG 9960 

TCAAACGACG 10020 

GCACAGGACG 10080 
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CCtCkCTGCG CGcGTGGTGG ATCTGCGCCG GGCGCATCCG G ACTT AG TAG ACGTC TCGTG 

CACTGCGCGG GGTATTCCGC TCGCTGTCCC GGCACCTGCA GAAGGATTCC CTGAGCTTTC 

TGGCGTGCTT GGAmTGCATA CGCAGgTGTT TGTGCGCAAA GATCACTCGG TGGAACTCAA 

GATGGGGGCA CGTATTTCAG ACAGCGTATT GCGCgCTGCG CCTTTTGAGC CGCGCGTGCT 

GTTTGACGTG TACGCGGATG TGTTGCGCCA GATACGGCAG ATTGCATTTG AAGCTACGGT 

GCGCGTCTCT GCAGAGGGTG CGTTGAGTAT TTCGGTAGAG AGTGACGCAG ATGGCGCGTT 

TGTGCGCGCT . CTTTCCCGTG CG TTTGCGC A GCAGGTGGAC GCATTGCGCC GCGCGGTCAT 

TGCAGAAGGG GAGCGATTTC TTGCTCAGCA ACGCCGCGTG TACGCACAGG AAATTGCGCA 

GGTAACGCAG CTCGTTTCCC GTGCGGAGGA CGCAATTGCC CAGCTGGGGG TGTCTTCTCG 

CGTGATACAG CAGAAACGGG CTGAGGCGGA GCGCCTTCTG GAAGCTGCAG CGCGCAAGGC 

ACTGGGGGAG GTGACTAAGg TGCCGCAGAC GAGCTGCAGA ACAAGGCGCG AGATGCATTC 

CGCTCCTTTT TCTAGGGGAG TGGCGCCGCC CCTTTTCGGT GCGGCCTCAG GGTTCGGCTG 

AnGCCGGTGC GGGCGCTTTG 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13257 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CAGGACGGTA nTCTC GTCTC TACGCTGACG AAGTTGCCAC TGATAGTGGA GATCGGTTTA 
TCCAAATGGC GTTGGTAAAA CTCTTGCCCC AGAGGGCGnC AGGCGGACAG AGACTACAGG 
AGATTGTGGC GCCGAGTCAG TCGGACATCG TGCTTATCAT GCTGCTAACC TGGCTTGAGC 
GTGCACGGCT GGACCGGTTC AATGCTGATG CGCTGCTTAC GGCGCAGTGG ACCTATGTGT 
CGGCTGGACT GTATGGGGCG ACGGCGGGTA CCAATGTATT TGGTAAGCGC GTGCTGCCTG 
CGCTGCGGTC CTGGCATTTT GATTTTGCCG GATTCCTCAA ACTCGAAACC AAAAGCGGTG 
ACCCCTACAC CCACCTGCTC ACCGGCCTGA ACGCCGGCGT CGAAGCACGC GTGTACATCC 
CCCTCACCTA CATCCGTTAC AGAAATAACG GAGGGTACGA ACTGAATGGA GCTGTGCCCC 
CTGGGACtAT CAATATGCCA ATTTTGGGGA AGGCGTGGTG CAGCTATCGC ATCCCCCTCG 
GTTCCCACGC CTGGCTTACA CCGCATACAT CCGTGCTCGG CACAACCAAT CGCTTTAACG 



10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10820 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
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TTATTAACCC CGCGTACACC CTGTTGAATG AACGAGCGCT CCAGTACCAG GTGGGACTGA 6 60 

CGTTCAGTCC CTTCGAGAAG GTGGAGCTCA GCGCCCAGTG GGAACAGGGG GTGCTTGCTG 720 

ACGCTCCTTA CATGGGTATT GCCGAGAGTA TGTGGTCTGA GCGTTACTTT GGcACGTTTA 780 

TCTGTGGGGT GAAGGTGGTT TGGTGAGGGG TTGTCGTGTG GGCCAGAGAA CGGGTACGGT 840 

GGGGGTGCGC GTTTTCCCCG TGGGGgCTGT GCGCGCTCAG TTTACAGGCG AGGGATTGCA 900 

GGGGTATGTG CGGGAAGCGT CTGGGTAAAG TGATGGTGCT CGGGTGTATG TTGCCGGGTG 960 

TGGCGGCGCG TGTTTCTCTC TCCCCCAAGC TCGGGGTGTA CGGGGACGCA CGCGGCGGTT 1020 

CTGACCTGTG GGGCATCTGC ATACAAGCTC CCACAATGCC AGATACAGAG AACCAGGCGC 1080 

CTCCGCGCTA TGCgcCgGAG AC AC CGTTGG TGGGGCTGGA CGTGGCGTTC CGTGCGGAAA 1140 

ATGGCTTCCT GCTCCAACTG ACGGTGGACG CGGCACTCAC GCGTTTAATG TTCTGCGGCC 1200 

GGTGTTTGGC CGGTTATTCG TTCAGACCGG GGGAAGGTAG TACGCATCTG TCGGTAGCGG 12 60 

CGGGTTTTGA GTGCACCGCG CTCATCTACG ATAGCCAGCA CTTTCTTTCG GTTCTTGGGC 1320 

AGGGCTTACT GCAGCCGAGC AGCTCGTCTT ATTCAGCCGG TAACTGGCAC CGCCCACGTT 13 80 

CATTGCTTGG CGTG CTAACG TGCACTGCCA AGGAGGTAGG CGCCATACAC GAAGAGTCGC 1440 

GTATTAAAGG GGTCTGTCAG AACTATGCGG TGCCGGTGCA GCTGGGGGTG CAGCACTACT 1500 

TTGGCGCGCA TTGGGGGATA GACGCGACGG CTACCGTTTC GTTTGGCATT GACACCAAGC 1560 

TGGCTAAGTT CCGCATCCCG TATACGTTGC GCGTTGGCCC GGTCTTCCGC ACCTAGGGGA 1620 

GGCGCCGGGA GGAACGGGTC CTGTCGAAGA ATTGCGGGGA GG AG TGAAGG TATGTGGAGA 1680 

AAATGTCTGG GTAAAGTGGT GCTACTCGGG TGTGCGTTGC CGTGCGTGGC CGCGCGTATT 1740 

TCTGTCTCTC CCAAGCTGGG GGCGTATGGG GACGCACGTG GCGGTCCTGA CCTGTGGGGC 1800 

TTGTGTATTA AGGCGACCGA TGCAGAGGAG GTAAGTGGGG ATCCCGATGA CACGGAGATG 1860 

GAGTATTTAC CTCCCCGTTA TGCGCCGGAG ACGCCGCTGG TGGGACTCGA TGTGGCGTTC 1920 

CGTGCGGAGA ATGGTTTTCT GCTCCAGCTG ACGGTGGACG CGGCGCTCAC CCGCCTGATG 1980 

TTCCGTGGTC AGTGTTTGGC CGGTTATTCG TTCAGGCCGG GGGGGGGtAA ATACGTATCT 2040 

GTCGGTAGCG GCGGGTTTTG AGTGCACTGC GCTCATCTAC GACAGCTACC ATTACATCAC 2100 

CATCCAGGCC CCCAATGAGG GTTCGGTGTG TTCGTTCGAA CATGGAGGGT GGTACGTTCC 2160 

AAAGACAGTG CTGAgCCTGC TGAGGCGCCG GAAGTGTCaG GATGCTAGGG CTGAGTCTGA 2220 

GGAATTGGGC ATCACGGGGA TTTGCCaGAA CTACGCGGTG CCGGTGCAGC TGGGGGTGCA 2280 

GCACTACTTT GGCGCGCATT GGGGGATAGA TGCGACGGCT ACCGTTTCGT TTGGCGTTGA 2340 
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CACCAAGCTG GCTAAGTTCC GCATCCCGTA TACGTTGCGC GTTGGCCCGG TCTTCCGCAC 
sTGAGCGGGT GCGC GCTC AG CGTGCCCCGT TTAGAAGGAg GCCGAGCGCT CCTCTACCGA 
ACCGTCTGCG TGCGCAACAA AGACGCGTAC GGTTCGCTCG GTGAACAAGC CATTTtCAAT 
GACCCCCgGC AGTGCATTGA GCGCGCGTTC CATGTCTTGC GGGGTGCGCG TCGGGAGCGA 
TTgCCACCGC GcGTCTAAAA TAAAAtTTCC GTGGTCAGTC ACTACCGGTC CTTTTTTTCT 
TyACCtCGCG TATGTGCACG GACAACCCCC AATCCTGAAG CGTGCGCATC ACGCTCATGC 
GGGCCTCAGG CACCACTTCG ATAGGnGAnT GCGCGCGTAC CTAAGGTTTC TACCACCTTT 
GTTTCGTCTA CGATGATAAC AAAGTGCGCG CTGTTGTATG CAGCGATCTT TTCTTGCAAA 
AGCGCAGCTC CACCGCCTTT GATGACAAAA TTTTGGGTGT CAATTTCATC CGCGCCGTCG 
ATAGTCACAT CCAGTTTGCC CCCAATCCGT TTTGAACTGA GAGAAAAAAG GGGGATGTTG 
TACCGCTCAC ATATGAGCGC TGTTTGAAAA CTAGTGGGCA CTGCCGCTAT GTCAGAGAGA 
GTGCCGCGTG CAAGGTGATC TGCGATGCGT TTTACCGCAG GCATTGCCGT AGAGCCCGTC 
CCAAGGCCAA TACTCATGTG CGCGTGCAGC ACCCCCTCTT GAACGAGGGT GTCnCaCTGC 
GCTGGGCAAC CAGCAATTTC TGCGCGGTAA CGTCTAATGG GGTG TTCGTC GTCGTGTTCC 
TCTCGTGCAT AGCTTTTTCC ACAAGTGCAC TCACGCGTCT GTATCCTTTT TGTGGTGCAA 
AAGAATATCT GCATTGTGCC aGCTGAGGGT TGCGCACTTA ATGCGCGCAG GCATGGATGC 
AAAACAGGCG AGGATGCACG CGTCCTGTAG GTGTGC CCGC TCCTGGTCTG TGAGGCACTG 
CTGTGCCATC ATGTGAAAGA ACAGCGCAAC CGTTTTTTGC GCCTGCGCCA CTGACGCACC 
CTTGATCAGT TCGATGAGTA TATTTGTAGA AGCGGTGGAC ACCGCACAAC CGGTACCTAA 
AAAGGCTACA TCAGCGATGC GATCACCTTC TCTCTTTATC AAGAGCGTGA GGTCATCGCC 
ACAACTGGGA TTATGACCCC GCTCGATGCt ACCGGCCCTT CTAACACCtG CGGTGTTCCT 
GCTTGCGTGC GTACTCGAGC AGC AC tGTCG GTATATCGCT TCTGCGTTCA TAGGGAGATC 
TCCTTAGAGA AAGGCAGCGA ATGCCAGGAA GGCGCACTGG CCTAGCTGCA TTGAAAAATC 
CTGCCCACGG cgTGCAAgCA CssGTcAGcg CCTcTACATC CTCCATGGTA TTGTATATGC 
AGAAACTTGC ACGGCAACAG GACTGAATGC TCAAGTGCGT CATGAAAGGC TTACTACAGT 
GATCGCCGCT GCGAACCATC ACGCCTTCTT CGCCCAAGAT ATGCGCAGTA TCGTGCGAGT 
GCACGTTCTT CACGTTGAAT GCAATGATGC CTAGGCGCTC GCGTGCGCGC GCATGGTACG 
TTTCAAGGAA GGGAAGCTCC TCCAGCCGCG CAAGGAGTGC AGCATCCAGC GCATGTACGG 
ACGCGCGGAC TGCGCTGcTC TCTAGGGACT CGCAATACTC AATCGCTGCA CACAGTGACA 
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CGACAGCTGC AGTATTCGCG CTACCTCCCT CGTACTTATG CGGCGnACCC TTAAAGACAC 4140 

TTTCCTGTTC AGTCACAAAA TCCACCATGC CTCCCCCATA CAAAAAAGGA GGCATGGATT 4200 

CCAGGAGCGT GTGCGGTGCG CACAATACGC CGACGCCAAA AAGAGAGAAC ATCTTATGGC 4260 

CGGAGAAAAC AAAGAAGTCG CAGCCTAAAT CTGCAACATT TGGCACGCCG TGCACCATAG 4320 

CCTGTGCTCC GTCAATGACC ACCACTGCAC CGACTTGGTG TGCAAGTGCG GTCAATTCCT 4380 

GTGCAGGATT TACCGCGCCG GTGGCATTGA CAACGGCAGA GAAGGACACA ATCTTAGTGC 4440 

ACGCTCGTAT CTTTTTCTGC GCTTCTTGTA TATCCAAATT TCCTTCGGCG TCTGGATACA .4500 

GCCACTGTAT CGTTGCACCT GTGCAGCGGC ACACGTGCTG CCACGGTACG ATATTTGCGT 4560 

GATGATTGGA GATAGCAAGA ACGATCTCGT CTCCTGCGCG CAGCGTGGCA GCGCGTAACA 4620 

GTGAGCGATG ATGTTGAGCG ATTCGGTGCA ACTCTTTGTA AAAAC GAT AT CGTGCGTTGG 4 680 

CGCTGCGTTG ATAAACTGCG CTGTTTTCTT CCGGGTGTTT TCTATAAGGA GCGCTGATTC 4740 

AACTGCAAGT TCATGGGAGC CTCTGCCTGC GTTCCCATTC AGATGGGTGT GGTAGTGCAT 4800 

AACGCGCTCT AGCACCGGCG CAGGGCGTTG GGTTGTGGCC GCGCTGTCTA GG TAG TGGAC 4 860 

GCGGGGACTG CGCAACAGCA GGGGAAAGTC TGCTTTATAA TTGGGGCCGC TCATGCCTTG 4920 

CGCTTCCTAT GTGCGCGATC GAGGCTCTCG TCAAAATTAC GTACGAGTGT CTCGCGGATG 4980 

TGAGCGTCAT CGATGAGGGC GAATACGGGT TTAAACGCAG CTTCTATGAT GAGGCGCTTG 5040 

GCACCGTACT CATCAAGACC GCGCGACATA AGGTAGTAGA GCACATCGCT GCCGATAGTT 5100 

TCAAAACTGG CTGCGTGTTC CCCGACAACG TCGTCTTCGT CACAAAAGAT AGTGGGGATG 5160 

CTAACCCCCA CGGCAGTTCT GTCAAGCAAA ATGGTACGGT CTGAGAACCG TGCTACAGAA 5220 

TGGCTAC AC C CGCGGTGCAA AAAAATATTT CCACGGAACG TTTTGCGTGC ACCGTCTTTT 5280 

ACCACCCCAC AGGCGCAGAT GTGCGCGTGT GAATTTTTTC CTTCCACGAT GAGGTTATGT 5340 

TCAAGATCCA TACGCCGCGC TTTATCAATG AAATACAGTG GGTGAATTTC CACACGTGCC 5400 

CACTCGTCCC GAAGGAAGGC GGAGTTGGAA ACACCTGAGA TCTGTGCACC TATTTGTACG 5460 

TCGTAGCAGC GCACCTGCGC GCTTTCCTGT GCGTGTAGGT GTACCGTTTC AAAGTTCACA 5520 

GCCGTAGGtG CGTGTTCTGT ACTTTGATTA ACTCTACCGA TGCGCCACGC CCCACCTGCA 5580 

CGCTTACCAA ACCATTCCTG AACGGAGCAC GCTCAAGCGC TTGCGGACCG ACGAgTGCGG 5640 

GAGCATCCTG TGGGGTATAC CCTGCGCGTC CACACAGACG AGGACTTTTA CGCGCGCCCC 5700 

TTCCTGTATA TCAAGAAAGG TCTGATCGTA TAGCACGCGG TTATGCGTGT CCATGGTAAA 5760 

ACGGATGAGT ACATGCACCG TCTCAGATGT GCGCGGCACA CTCAGATACA CCCCTGCATT 5820 
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TCGTTTTTCC TTTACTTCTT GTACATACGC TTCCCCCAGA CCGCAGTGGC GCTGACGATC 
GCGTGTGCGA AAGTCAAACG CGCGCGcCGC CCTTTCTGGC TCGGTGCAAA GAAAATGCTC 
TATGGATGAT GAGcGCACAA GCGCTGAGCT AGAAGTAACC TGTGTTCTGG AAAAAGGCTG 
AGAAGCAACG TCCTGCGCGT GATACCCGAG CCGTTTAAAA AATTCCCTTT TTTTCATACT 
GCTGTGCGCC CTCAGGAAGG GAGTTAGCCG ATCGCTCCCT CAAGTTCAAT GGAAATAAGG 
TTATTCAGTT CAACGGCGTA CTCAAGAGGC AATTCCTTtG AGACGGGTTC TACGAACCCT 
CTGACGATAA GAGAGATGGC AGTCTGCTCA TCAAGCCCGC GCTGCATGAG ATAAAAAACG 
ACGCGGTCAC TGATTCTGCC GATTTTTGCC TCATGTCCGA TATCAACGTT ATCCGTACGT 
ACATCAATGA TGGGGATGGT ATCCGTATGC GACTGGTTAT CGAGCATGAG GGACTCGCAC 
TCAGCGACCG CTTTTGCCCC GTCAGCCTTT GGACCGATGG AAAGCAACCC GCGGTAGTTT 
GCCGTTCCGC CATTCTTTGA TATGGATCGA GCATGTACCT CCGATACCgT GTTCCTGCCC 
AGGTGCACTG TTTTTGTTCC AGTATCGAGG TACTGTCCTG CAGAAGCAAA AGTGATGCCG 
GTGnAnAnCT GCGCGAGCGA TCTCC TCTG A GGATACTCAT CGGATATAAC ATCGTGACGC 
GGGAACCAAA GGAGCCTGAG ATCCACTCGA TGACGCCGTC TTCGTCCACA ATGGCGCGCT 
TGGTATTGAG GTTGTACAGG TTTCGTGACC AGTTTTCTAT GGTGGAATAG CGTAGGCGCG 
CGTTCTTTTT TACGTACAGC TCCACGGCGC CTGCGTGCAA CgcATTTTTG TAGTACTTCG 
GCGCGCTACA CCCTTCGATG AAGTGGAGGG ATGCGCCTTC ATCCACAATG ATGAGCGTGT 
GCTCAAATTG CCCGGATTGA TTTGCATTCA AGCGGAAGTA GGACTGCAGG GGTAAGTCCA 
CCTGCACCCC TTTGGGCACA TACACGAACG ACCCGCCTGA CCACACCGCT CCGTGCAGTG 
CAGCAAACTT GTGCTCGTTC GGTTTAATCa GATGCATAAA GTGCGCGCGG ACAATGTCTT 
CGTGCTTGTG CACGGCAGAC TCCATGTCGA GGTACACCAC TCCCTGTTGT TCTAAGTCTG 
CCCGGAGGTT GTGGTACACc ACCTCTGAGT CGTACTGCGC TCCTACTCCT GCAAGG g ATC 
TTCGCTCCGC CTCAGGAATA CCGAGGCGAT CAAAAGTCTT CTTTATCTCC TCTGGGACGT 
CATCCCAACT TTCTGCGATT GGCTTAAAAT CGGAGaCAAT GTAGTGGACA ATCTCTTGGA 
TATCAAGGTC AGAGATATCC GCGCCCCACT CTGGCATGGG TCGCTTCATA AAATAGCGCA 
AGGATCTGAG ACGCAAGTCG AGCATCCACT GTGGCTCCCG CTTGCGACGC GAAATTTTCT 
CTACAACCTG AGCGTTCAAA CCCTTACCGG TTGAGTAGGT GTAGGTAACG GCGTCTTTTA 
CATCGTAAAT ACCTCGCTTG ATGTCCGATA CGTACGTTCG CCTGCGCGGC TGTAAAAGCT 
GTCTCTGTTG CTGTGTATTC ATACGCGCTT CCTCTAAGCG GTGGATATGC GGTCTGCTGG 
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CGTGGGGGCA GCGCGCCTCC ACAAAAGGAT AACTTACTTT TTCTATTCCG GAGAAACGGG 
CGTGCCTCCT TGTTCGCCTG CACGGGGCGT GGCAAAGTCA GCGTAACCGT GTTCAACCAC 
GTAGTGCACC AAACTCACGT CACCGGTCTT CACGATGGTA CCGTCGACGA GGATGTGCAC 
CACGTCAGGC TTAATGTACT CGAGAACTTC TCGGTGATGG GTGATGATCA GGAATCCCAT 
ATCGGGCGTA CGGATATCGT CAATGCCCTC GAAGACAATG CGCGTAGctC nAACATCAAG 
TCCTGAATCC GTCTCGTCAA GTATGGcCAG TTTGGGCTCA AGAACAGCGA GCTGAAGTAT 
TTCGTTCTTT TTTTTCTCTC CCCCAGAGAA TCCTACATTC AGGCCGCGCG AGGCGTACGC 
CTCACTGATG CGCAAgcGAG CAAGCTTCGC ACGCAACTGC gTGTGAAAGT CGAGCACGGA 
AACTTTAGTA CCAAGAACCG CCTCTTTTGC CGCGCGGAGA AACTCCTCGA CCGAAAGACC 
GGGGACTTCC TCAGGAGTTT GGAACGAGAG AAAAATACCC CGCCGAGCGC GCTCGTACAC 
AGGCACGTCG TTGATACACT GCCCTTGAAA ATAAATTTCC CCACGTTCGA TAG TGC AGTG 
GGGATTTCCC ACGATGGTGC CTGCAAGAGT GGACTTGCCT GCACCGTTCG GTCCCATGAC 
GGCGTGCACC TCGCCGGTAT TCAGGGTTAG GTTGAGACTT TTGAGGATGG GCCTATCCGC 
AATGGACATA CACAGGTCGC GGATATCGAG GAGTGTGGGC ATGAGCGGCT CCTGCAAGGA 
GTAAACTGAG CAGGAGTATA CGTACATTCG AATGTATGTT GCAAGGGAAA GAGACCACGC 
ATCCTGCACA GGAAACACAT ACAGGTTTAA TACCGTGCGC AGTGTATGTC CTACCTTGGC 
GTTCTACCAG TTAATAGTCA TTCCGCACAC GAAGgTGCCA AAATACCTAC GGGAAGTAAC 
GGTTTCCGTA ATAACCATGT AGGGCGTTGG TTCCAACTGT CCCTGCTCCC ACTGTGCGTG 
AAGCGTCACT TTTTCAATGG GAGAGAGCGT CAGGCCCACC TGGTACTGCA CGCAACGCTC 
ATG AAC TAG A TTTTCTGTCT TAAGATTATA GTTGAAGCGA TTCGTGGTGC CGTATACGGC 
AAGCGAAGGT TTAAGCCATG CAGTTTCGCC AAGCGGAATA AGGTAGCGCG CCCACACCTT 
CCCCATAACG GGCAAGTTGA TATGGGTGTC AGGGAGTTTC CACACACCCA TTGGAGAGAC 
GTAtATCCTT TTCCATTGTC TATGTACAGG CCGTGGG TAA GCGGGATATA GCACCGCACG 
TCCATCCCTG CTTCCAGTCC GTCTATAAgG TGTGTATAcG CGTCTCCCgC TTTGGTTTCT 
ACCCGCAGAA AGCCgCCgCC GTCCGTGTGT GTGCTCCCAT ACGTAGGAAA GACCATGGCC 
CCAAACACGC TTGCAGGAGC AGTGGCCACG TACACGCCGC AGGCAAACCA GCGCCATTGC 
AGAGTCAGGA GTGCATCGAG CGCGTAGGTG TCCAG CGCTT GTTGCCAGAG CACTGTCAAC 
AGCGCTGCAA GCAAGGGGTT tfGAACTAACT GTCGGGTTCG TCTGTTCTGC TTGTACGATT 
TTGGTTGCAA TATCCCGCAC AGAGGATCCG TTAGCGGACA TTCCGTTCTT GACACTATCG 
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AGCTTCTGCG CGTACTCAGG CAGCTGCAAC ACGCGCTGTG CAATGcATAC GCCACCGGkC 
yTCCGGGGGT GACCTTGGTT TTTTCCATAT CGATAAGGAT CGGGGCAAGA TAGTCGAGCG 
TTTTCCGACC TGCATAATGG TTACCCATGT CCAGAGCCAA CACCACCTTG AAGTCCGAAA 
GAGGAGTGAG CGTTATGCGC GCGCCTGCAT TCCACAGGAC GTCATTCTGC CTATAGTGCC 
TTTTCTGTTT ATGGCTGACC CTTTTATAGC CTTTTCCCAG TGTCGCGCTG GCTGCCACTT 
CTGCCCGTAC TAACTCGCGT TTCCACGGCG CATAGCTGAG CGTCGCGTCT GCCCCGAAAC 
CAT AC TTGCT ATGCAGAGCC TCTTCCGCCG CACCTCCAGG GGGAGCAGGA GCAGGGGCAG 
GCGCGTCCCA GGAACCATTT GAGGCAAAGG AAAGAAAGCA AATATCCAAG CTCACTCCAC 
TTGAGCCTAC GTTCTGTGCA CGGTAGCCGA GCCTGCCGCC GATGCCATCG AACCC TGGCG 
CAAACCGCAC CTCCTCTTCC TTGTAGAGGT CTGCAAGAAA AGGTTTCCAC AGTTGGGCAA 
AATTGGCGCG AAAGAGGGGA GCGGTGCCGA TCGTCATATA CGCACCAAAG CAGTGTAGCG 
TCGCTTCAAT AGCGGTTTCT TCTGTGACTA GGGTAAAAGG CTCACCAGGC TTTTTGGTCT 
GAAAATTTAC CTCCAAATCC TTGATAGAAA TTTCAGTCCA CAAGCCACCG TCAGATAATG 
TACCTGCGCG CCTTATGCGG TCACTTTTTA AAAACAACGG AACAGtTAaC ACAAGTGTGT 
GGTACTGCGG AACCCGTGCG TTATCTGACG AATGCGCGCT TTTTCCTTCT CACTCTCTTC 
TACTTCCTGT CCCTGCGCTC CATTTCCCTG TATCTGTACG TTCACCACAT CTCTGTCGTC 
GCCATCTCCA TTCTCGCGGG GCGGCACGGG AGGCGGCGGA CCTACTGCGG GGTCGTAGGG 
GAGCGTGATA CCCCACTGCA ACC r GGC AAA GCCTGAAAtC CGCGGCGTAC TTGCTAAGGT 
GTGCAGTCTG GTTTCTGCAC ACAGTCCTGC TGCGGACGAC ATAAAGAGGA GCGCCCACGT 
GCCAACTGAC CTGACACGCC TCCCTGCTGA GTGTCCAGGG GGATCATGGC AGGAGGAGCG 
CACAGCCGCC GACGGAGACA GCAGCGTGCG TCGCCGCCGG ATTTTGCTTT GAGAGTACAA 
CACCyTGCAG GCATAAAGAC AGGGACAGCG TACTCCTTTC ATGGCTCCAT CCTAAAAGTC 
CGCAGTGCGC GGCGTACGAG GAAACGGAAT AACATCCCGA ATATTCCCAA GCCCGGTGAC 
GTACTGCAGC AAGCGCTCGA AGCCGAGTCC AAAACCTGCA TGGGGCGCGG TACCAAAGCG 
ACGGAGATCG GTGTACCAGC GATAGTCGTG AGGGTCAAAA CCGCTGGCAC GGATGCGAGC 
ACAGAGTACT TCAAACTGTT CCTCGCGCTC CGAGCCTCCC ATAATCTCCC CTAATCCCGG 
AACTAGCAGG TCCATGGAAC GCACCGTTGT GCCGTcGGCA TTGAGCTTCA TGTAGAAGGC 
CTTGATTTCC TTTGGGTAGT CATAGACAAT CACCGGGCCG TGGAACACCT CTTCTGTTAA 
AAAACACTCG TGCTCGCTTT GTAAATCGCA TCCCCAGCGT ACGGGGAACT CAAAGGAGCG 
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CCCACTGTTC TCCAGTAGTT TAATTGCCTC TGTGTATGTC AGGCGCGTGG CAGGCGCGCG 11100 

GGCGACGTCT TCGAGCATGC GCGTCAgCTG CCCTGGCGTC CGCACTGGCG GTGTGCGCGC 11160 

TGTGcTGCGC GCGGcAAGGG GTGTGTCGCC CCsCGCGCTt CcGCATGgCT GyGCgCGCtc 11220 

GTCAAGGAAG GCTATATCcT GCGCGCAGTC CTTGAGTGct GCGCGTAGCA GGTACGCCAA 112 80 

AAACTCCTCT GCCACGTCCA TGCAGTCAGT GATGCGTGCA AAGGCGATTT CCGGCTCCAC 11340 

CATCCAGAAC TCAGAAAGAT GGCGGCTAGT ATTTGAGTTC TCTGCGCGAA AAGTAGGGCC 11400 

GAAGgTGTAG ATGCGCGTGA GGGCAAGCGC ATATGCTTCC CCCTGC AGTT GGCCCGAAAC 114 60 

GGTTAGGCGC GCTGCCTTAC CAAAAAAGTC GTCCGCGTAC GTGAGTGCGT AGGGGTTGCC 11520 

TGCCGCGCCC GCTGCGTGTG cTTCGCGCGC AATACGC AC G GGATCAAAAG TAGTGACGCG 11580 

AAAGAGCTCG CCTGCACCCT CGCAGTCCGA AGCGGTAATG ATCGGTGTGT GCACGTACTG 11640 

AAAgTGTCGC TCGGAGAAAA AGCGGTGGAC AGCGCCTGCA AgTGCACTGC GCACCCGTGC 11700 

ACACGCGGCA AAGGTACTAG TGCGCGCGCG CAGATGGGCG TGCGCACGCA AAAACTCAAA 117 60 

ACTATGCGAT TTCTTCTGCA AAGGATAGGT TTCAGCAGGC GCCTCGCCAA GAACAGTCAG 11820 

GTTGCAAGCG CGCAACTCAA GCGCTTGCCC GGCGCCTGGG GAGGGGACGA GTGCACCCTC 11880 

GGCGCGAATG CAGGCGCCGG TAGTAACGCG TTTGAGCGTT TGAGCGAGCG TTTCCCCCTG 11940 

GAGGACAGCG TCGCGGACAT TAGGGATTTG CTCAGTTGCG CCCCAGAGGA AAGGGAGGCG 12000 

GAACACTCGG GcAGGGGAAC GGTAACCTGA AGGGTATCAG GGCAAGAACC GTCGCTCAGA 12060 

CTGATAAAGA CAGCGCGTTT TGTCTCCCGT TTGGAGCGCA CCCAACCGTG AACGCATTCG 12120 

TGCTGGCCTG AGGGGGGATG AGTCAGAATC TCCTTGAGCA AAGGG TGC AT AGCACGCACT 12180 

CTAACGCTTT TACCCTCTTT GTGGAAGGGC GTGGACCGGC AAGCAGCTGT GACGCCACGG 12240 

CGCACGCCCT GCGCGGCATC TGCATGGAAG GCCGCGCAGG mCTGGGGAGA GCGAACCGTG 12300 

CGAAAAAGCG TCGTTTCATT TCCAGGGAAC TTACTCTCTA GATGAGGAGC GGCCGAGGCG 12360 

CGGTCTTCTG TGACCGGGAC CACGCCGTAC GACATAGAAA ACCAGATGCA AGTAGAGTAT 12420 

CAGAAACACT CCCGCAGAAA GGACGCGCGG GTGAACGATT ACCGCGCCTA TAAGACTCCA 12480 

CAGGTGCACC CTTTCCATGG CGGATCCCTC GGCATGTGTG TTTCGTTCCT TAAGGATACC 12540 

TGGGCACAAA CCCTTGACGT GGTGCGCAAA ATGCTCGACC ATGTGCCCGC GCGTGCCGTG 12600 

CGGCAGTGTC TCGGTCCCGT GTGCATTTTG TACCATAAGT TTCAGGAGGA AATGTCCTAT 12660 

GCAGCAGCGC TTCTTCTTAC TCGGTGTCTG CGCTTTTGCT TTTGGCGTCC CGGTTTTTCC 12720 

CCAGCAGGGC ACAGATCCAA GTGTGGGTGC TCAGGCCAGT GCGGGCGACG GAGGCATGAT 12780 
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GACCGTCGAG CAAGCCTATC TGAACTCTGC AGAGGGTGTG GTGATCAAAG AGATGGTTGA 
gAGCaGGGGG CATGATTCAA AGGTGCTCGC GCTCCAGTAT ATCCAGGAGG CACTTGAAgG 
CGGACGTGGT TCTGATGACC TCCAGGAGGC GCTAAGTCGG TTGGCCACTG CTGGATTGTT 
CCGCGTGATC CGTGAGCAAG GGCGTGTGAT TAATGATTTC CCCGACATCC GCCTGCGTGC 
TTGCGAGCTA CTCGCCCGGT TTCtTCGGCT CGTACCAAGG ACGCTCTCAT CCAAGTCATG 
TGTGCTGACC GTGAGCTTCG GTGGTGAGGG CGGCGGTTAA GTCGTTAGGA GAGGTGGGTA 
TCAACGAGCA GGACGAGACA ACCGCCACTA TTGGCTGGAT TAGTCGGAAG TTTTCCGCTA 
TTAACCCGAc AGGTTCTCTC GCGCTTGAGA TTTTGAACAC GTACGAGCGC CTTGCTC 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14512 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
AGTTTCCCGA GTGGTCAAAG GGAGCAGACT GTAAATCTGT TGGCGTTGTC TTCCAAGGTT 
CGAATCCTTG ACTCCCCACT TTCGTCTTCC GTTTGCTTTT GGGTAGTGTC TGACTTGTCT 
TTCCCTGGCG TTTCGTTCCA GGCGTTTTTG CTAGCTGCTG TGCCTCTTGT CACTTTCTTG 
AGTGCAGGAT GTTCTTTTCG TGCGCGTGCG CGCGGTTGCG GAAGGATTTC AGTGGCGGAG 
GAGGGGACGT GCGTGTGCAC TTCTGGGGGG TGCGGGGGTC TGTGCCTACT CCTGTGACAC 
CTCGACAGGT CCaGTCAAAG ATAGCgGCTG TCGTTCaGCG CATAAGTGCa AAGGATGTCA 
GGAATCAGAG ATCCAAGGAG CGTTTTATTT CTGATCTGCC TGCCTGGCTC TTTGGGACTA 
CGGGTGGGAA TACTACGTGC GTGGAGATGG AGACTGATTG CGGGGAAACC CTCATCTTTG 
ACGCAGGGAC AGGCATTCGT GATCTGGGTA TCGATCTTAT GAGCCGTCCA GGCTACAGGG 
CGCAGGGGCA TGTATACCAC CTCCTGTTTA CGCATTTTCA TTGGGATCAC ATCCAGGGGC 
TACCCTTTTT CAATCCTGCC TTTGATCCTC GTAATACCAT TATCGTCTAT AGCACTCGCA 
AGAAAATGAA GGAATTCCTT GAAGATCAGA TGAGGTATCC TTACTTTCCA ATATCTATGT 
TTGGACGCGA CGGTTTTAAC GCAAAGTTTG AATTTCGCCT GATAGGTAAC CATGAGGAGT 
GCTTTGCTAT TGGGAAGACG AAGATAACTT GGAACCGGGT GCGTCATCCA GGCGGATGTG 
TATCGTATGC GGTGAGCGAG GCTGGTGGGA AGAAGGTGAT TTTTTCTACC GACACCGAGT 
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TACGGCAGAA GGATTTTGAT AGAAGTGAGC GTAATGTCTG CTTTTACGAT GCCGCAAGTC 
TGCTCATAAT TGATTCGCAG TACACCATGA CTGAATCCAT CAAAAAAGAA GGGTGGGGCC 
ACTCCACGTT CTC TATAGTG GTTGATTTTG CAGTAAGTTG GGGGGTGAGA AGACTGGCGC 
TGTTCCACCA TGAACCTACG TATGATGATA AAAAGTTGTT TAGCATTTTG CAGAATGCCT 
GCTGGTATCG CAAGTACGTT GGTGCGCACG ATCTTGAAAT ACTGCTCGCA CAGGAAGGAA 
AGGATATCTT TGTATGAGTG AGGAGGAGCG CATGTATAGC TTTAGCGGtG AAGAAATCAA 
GGAACTCGCG CTTCTGTTTC GTCGGTGTGG GCAGACATTG GCGCCGGCGC TGCGCCGTCT 
TGCGCTGTTT GTCGATCGCA CTGTTTGTCG C C ATATGACG GTTGAGGAGG CTGAGGATTT 
TTTCGGTAGT GCAGAGCGCT AGGCAGTCGT GAGTATCCGG ACCTTTTCTT TTACTCCCAG 
CGTTCGGCTG AGGCGCATTG TTCTCTGGGG CAGTTTGTTC TGTGCGGGTG TTCTTTGTCT 
GCTGTGTCTG TGTCTTTtAG TTGGCCTTGC CCCGGTGCGT CCTTTTGTGA AAAAGGAGCA 
TATGTTCACT GTGCAGTCCG GTGTGGGCGC GCGGAAGGTC ATTCACGAAC TGAGGAACGC 
ACGGCTCATT CGATCCGAGT GGGCTGCGCG gTTGTACGTG TTCGCGCGCG CGCTTAATTT 
TAAGGCGGGT A c T AC GC AG T TTCTCCTGCA ATGAGTGCGG TGCGCATTTT AACTATGCTc 
GACGATGTCG AACAACAACG CTTTATCAAG GTCACCGTCC CCGAGGGACT GACGGTAAAG 
AAAATTGCTG CACTGTTGCA AGACGCTACA GTGGTAAGTG CAGCGGCGTT TGTGGAAGCT 
TGCACGAGCG CTGCATTGCG AACGCGCTAT AAGATCCCTG CTCCTTCAGT GGAGGGTTTT 
CTCTATCCTG ATACG TATTT TTTTAGTTAC CAGGAACGCG CGGCCAATGT GGTGGGAACC 
ATGATCGAAA ACTTTCTGGC CAAGACTAGC CAGTTGCCGT CGTTTCCTGG TGATCCGGTT 
GCGCGATTTA AAAC CGTCAT ACTCGCTTCA ATCGTGGAAC GCGAGTAcCG CGTGGCTTCT 
GAGGCAGCAC GCATCGCAGG TGTTTTTTAT AACCGGATGA AGGTAAACAT GGGACTGCAA 
TCTTGCGCGA CAGTCGAATA TGTCATTACT GAAATTGAGG GGAAAGCGCA CCCCGAGCGC 
TTGTTCTTTA AAGACCTTGA AATAGACAGT CCATTTAATA CGTACAAATG TGCTGGGCTG 
CCCCCAGCTC CTATCTCAAA TCCTGGGCTC ACCGCGTTGA ATGCTGCGCT GCATCCTGAA 
GTGCATGACT TTTTCTATTT TAGGCTCACC GATCCGCAGc GGGcACGCAC ACGTTCACCA 
AGACGTTGGA CGAGCATGAT CAAGCTGGGC TCATGCTGCT AAAGAAAAAT ACGGGAATGT 
AGGCAGTGGC TCGGATTTCT GCGCACGTTA TTGATGCGAT TGCTGATCGT GTGGATTTGG 
TTTCGCTGGT GGGAAATTAC ACGCATCTGG AGCGGCGTGG GGATGACTGG TGGGGTCGCT 
GTCCATTTCA TCATGAGCGT ACGCCTTCGT TTCATGTGGT GCCGGATAAA AAGATGTACT 
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ATTGCTTTGG GTGTGGGGTT GGTGGATCCA CTATTAAGTT TTTTATGGAA ATCGAGAAAA 
TTGATTTCCA CGAAGCGGCA GTGCGTCTTG CAAAGCGTGC AGGAATC GAG ATGTCCTTTG 
AGGACGGGGT GCACGCTCCT TCTGcTCATG CTTCCTTTAC AATGcAGCTG TGTGAAGTGT 
ATCAGCGCAT TGCAGAGACG TTCCATCACG TACTTATGCA CACCGCGCAA GGamGCGTGC 
GCGCGCGTAC CTAGCCTCGC GCAaGGTAAC GGATGATTCA tACGC AC t TT AAGCTCGGGT 
aCGTCCGCCG GATCCGGTAT GGTTGTTTCA ATTTTTAAGG CACAAGGGAT ACTCCCCCGA 
GTTTCTGGCC CGTTCTGGGT TGTTTGCAAA AAAAAGCGAG CGTATCGCCG TTTTTTCAGA 
TCGGATCATG TATCCGATTG CCGACCGCTA CGGTC AGGTT ATCGCATTCG GAGCGCGCGC 
CTTGGGGACT GCACCTGCAA AGTATTTGAA CACGGCAGAT ATGCCACAGT ATAAAAAGGG 
TGAGCACTTG TTTgCtTTCA CTGTGCTCTT TCTCAGATGA GAAAGACGCG CGCGGCGATT 
ATATGTGAAG GATACATGGA TGTTATCGCG TTTCATCaGG CGCAGTTGAC GTATGCTGTT 
GcGCcTTTAG GCG CATTGCT GACGAAAAGC CAGGCACGTT TGATGCGTTC GTTTGTCGAT 
CGAATATATA TGTGT^PTOGA TGCCGACGGA GCAGGCAGAG CGGCAACGTA CAAGGCGATT 
TTGTTGTGTC GTTCCTTGGG TTTTGAGGTA CGGAT AG TAG AATTGAATGG AGGTACTGAT 
CCTGCAGAAT GTGCGTGTAT AGAAGGAGAG GACGCTTTGA GAAAAAGCGT AGAACGGAGC 
ACTAcTGACG CGCAgTATTT GATACGGTGT GCACGCCATG AGCACAGTCA CCTTGGTGCA 
GATGACACAT CACGTGCGGT GTCCTTTTTA TTCCCTTATC TGAGTGTCTT GGACTCTGCC 
ATTCAGCGTG AGCAAGTCAT GCAGGATATT GCGATGGCGT TTGGCATTCG CATACAGGCG 
GTGCACGCAG ATTACCTGCG TTATGTGTCC CGTACCACGC AGAAAGGGAC AACAGGGAAT 
TGTGTTCTGT CTGTACAGGG AACAGCGATA CAGGTGAAGG AGCCTGCTAC GGGAGTACGC 
ACTGCGCAgC TGCGTTTGGT ACTAGCGGTG GTAGCAAATC CTGAGTTATT TGAGCTCcTG 
CGGGAGAGTG TGTGTGCAGA TGACTTTGAA GATCC TATGG CAAAAGAGTT ATTCATAATC 
CTAGAGGAGT GTTATCGTGC AGACACGCGT GCAAGTCCGC ATGTTCTTTC GTGTTGTACA 
ACCGACGAGT TAAGGAAACT CGTGAGCGAG GCAATTGTCT GTGGTGAGTT CTCTTGCAAT 
GCGCCGCAGA TTGTGCGTGA CGGTGTTGCG CTCGTGCGTC GTAATAGACT GCTGAAGGAG 
CGAGAATCGC TCGTAGGgCG GCTGCGCCGA TTTGGGGATG CATCTTCGGG TGAGGAGTGC 
GGGTCTATGC AGGAGCTTAT GATGGAAAAG CAGCGGGTTG ATGAGGAGTT AGAAAGGTTG 
AAAGGGGTGA GGAAATGATG GAGCTGTCAC GTACTCCTGC GGTGATGCGC CTGTTAGAAT 
ATGCGAGGGA GAAGAAGGCT ATAACGCATG ATGAGGTCGA GAACATACTC GCGCACTATG 
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GCGTTGAGAC AGAAGAGCTG C T AC ATGATG TGCTTGATAT GCTTGAGCAG GAGAATATAA 
AGGTCTTCTC CTCTGAAGAG GAGGAGCTAG AAGACGAAGc TTTGCAGGGC TGAAAGGACC 
TGCCGCGGAC GATGGCGATG GGTCGTTCCC CCTTTCAACT GAGCGCGTGC GTGATAAGCT 
GTGCGACAgT AGCCGTGGGG CACGGCAGAA CTTGCTGTCa AACGCGCGGA ATATTGCACT 
TGACGATCCG GTgAAaCTCT ATCTGCGTGA TATCGGCCAA GAAAAGTTGC TCACTGCGGA 
CAAGAGGTCA TGCTTTCAAA GCGGATGGAA GAGGGCGAAG cATCATAAAG GACATTATTA 
CCCAGTCTGG GCTCCTTCTT CCTGAGTTTT ATCACATTGG GCGCAGTCTT TCTAAAAAAG 
CTCTTGCGGT TTTGGATCCT GCAGAAAGCG GACGTACGAG AAAGGAAATC AGCGAGGAGA 
TGGCCGATCG CCGGCGTCTG AAACaGGCAT ACGGAGAGGT GCTtCGCTCC TTGTATCCTG 
AAATGCGTCA TTACATGGCA ATGAAAAAGC GGCTGGATGA GCGTGGGGAG CCGGTGACGg 
TTTTGAGTAG TGATGAAGAA gTGTGTAAGC AGCGCGACAA GTTGCTTTCC TGTTTACAAA 
AGGTGGACTT GCAATTAGAG GAGATAGATC GCTTTTCTCG AAAATTTTTG GACACCGCGC 
GAAAAATACG GGAATACAAG CGGCGTAAAG ATCGCCACGA AAAGCAACTT ATGATTGCTG 
AC C TGTGTG A CATGCGCAAG ATTGGGCGTG GTCTGGCCGT GCCCCGTCAG CGTGCAAAGT 
TGGAAGAGAC GCTTGGTATG TCTGCAGATT GTATTCAAGA GATCTATACA CAGATTCAAA 
AAGTGACACG CAGGCTGCGA CGCATCGAGT ATGACTTTGA AAATACCATC GACGGTATTT 
TATCCATGGC GCGGGCAATT C AC CGGGGTC ATGTCATGCT CAAGAAGGCA AAGGATAAGC 
TCATTAATGC TAATCTGCGT TTAGTTGTGT CGATTGCAAA GAAGTACACA AACCGTGGAT 
TGCTT TTTTT TGATCTCGTG CAAGAGGGCA ATATTGGGCT GATTAAGGCG GTAGAAAAGT 
TTGAATATCG CAAGGGATAT AAATTTTCCA CGTATGCGAC GTGGTGGATT CGCCAGGCAA 
TTACCCGTTC TATTTCCGAT CAGGCGCGCA CCATTCGGGT TCCGGTACAC ATGATAGAGC 
AGATAAATAA AGTGACGCGT GAGTCTCGGC AGTTGTTGCA AAAGTTTGGG CGTGAgCtTc 
TGATGAAGAA ATTGCGCAnA GCTCTGTTGG ACAGTTGAAA AAGTTAAGCA GGTAAAAAGT 
GTTGCGCGCG AGCCTATCTC TCTTGAAACT CCAATTGGAG AGGAGGAGGA CTCTTCCTTG 
GGTGACTTTG TCCCTGACGC TGACGTGGAA AATCCCTCTC GAGTTACAGA AAGAGTCTTG 
CTTAAAGAGG AAGTGCGATC TATCCTCTCC GCTCTTCCTG CGAGGGAGCA CGAAgTTTTG 
AGAATGCGTT TTGGTCTCGA TGGAGACTAC TCTCAAACGT TGGAAGAGGT CGGTTTGTAC 
TTTGATGTGA CGCGTGAGCG TATTCGGCAG ATAGAGGCGA AGGCCCTTAA GCGTTTGCGT 
CATCCACGAC ACAGCAGAAG ATTGAAGGAT TTCCTTGACA GTTAGGGGTA TGTTATGGTT 
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CCTGCAAATG TTTTCGAGAA CTTACGGGCA CTGCAGGTGG TGCTTGCGCA GAAGAATCGC 
TTGGAAACCG AGATTGCAGA GGCGCCGAAG TTCTTAGTCG CTCAGGAAGA GTTGCTAACG 
CGTTGTAAAG AAAGTTTTAT TGAAAAGAAT GTCGAATACG AATCTGTGCG CGAAGAAGTT 
GCCCGTCTGA CCACCGAGTT GTGCAAGGCA GAGAAGCGGC GTGAGGATGC GGAAGTTGCG 
ATGGACAACA TTAGCACGCA GCGGGAGTAC GATGCGCTCG ATAGGGAGAT TCAGGAGGCG 
AAGCGGCAGG AGGTTGCATT GCGCTC CGAG GTAGCGCGCT CGGATGTAGC TTATAAGCGT 
TTGGCAGAAG AAATTAAGCT TGATCAAGAA GACATTGTGC AGCAGGAGAG GGAGCTTACG 
GAGAACAAGG CTCGCGTCGA CGCAGAGGTG CGTGGTAAAA GGGAGCAGGT GTTGCGTTTA 
CAGGAGGAAG AGCGGCGTCT TTCTCCAGAT CTTGACCGGG ATGTACTCTT TAAGTTTGAG 
CGTATTATCA AAAGTAAGCA GGGCGTGGGT ATCGTACCCG TGCGGGGGAA CGTGTGTGCA 
GGGTGCCACA TGATTTTGCC CGCGCAGTTT TCAACCGGCG TACGTGAAGG GAACAGTATC 
GTGTACTGCC CCTATTGCAG TCGGATTCTT TACTATGAGG AGACAGATGA GCCTGAGATG 
ACCTTCTTTG ATGAAGAGGA CCTGGGCAGT CTGTCGGACC TTGTCTATCC AGAAGAATCT 
GGAGGATTTG GGGGAGGTGA CCGGGAAGAG ATATAGAGAG GTTGGTAAAT GGGGTGACAG 
AGAAC TGC AG ATAGTCGCTG CGGGTTTCTC GCAGAGGAAA GTCCGGACTC cTTCGGAAAT 
GATGCTAGTT AATTACTAGG CAGCGGCTCT CTGCAGTGCC GCTGACAGCA AGCGCCACAG 
AAAATATACC GCCTTTGGGT AAGGGTGAAA GGGCGAGG T A AGAGCTCACC GCGTTTTGGC 
GACAAAACGG CACGGCAAGC CTCATCAGGA GCAAGATCGA GCAGCAAAGG ATATTCCGAT 
CCTGTTTTGC GGGTTGATTG CATAAATTTA TATAGCGATA TATAAAGTGA GACAGATGAT 
TATCCTTGAC AGAATCCGGC TTACCAGTTC TCTGTTTTTT TAGAGTATCG ATGGAATTTC 
TACTAAGGCG GACGGgCACC AGAGTTTCTT CCCGGGGCGG AGGAAACTGC CTAATTCCGT 
GTTCCTCTTT TCGGTCTTTT TCGCCCTGGT AGTGGGCGTG GGGGTTGGTG CGTGGCGTTA 
CCGTCGGTAC TACCGTGGGT TGCCGAGCGC GCGCAGTGTw ATGAGGACTG GAAGAATGGT 
AATTACAAAG CGGTGTACGA TAAGGCGGCT GAAATTCTCC AGAGGCGGGT GTTCGACGCT 
GAGATGCTCG CGCTGCATGG GTTTGCTGCC TACTATATCT TTTCAGAGCA GACTGACCTT 
TCTGTCAGTT ACGACTACCT CAATAGTGCT ATTGTGTCCT TGCGCCGCGC GTTGCATGTG 
GTGCGCCCTG CAGAAGTTCC CAACGTTTCT TATGTCCTTG GCAAAGCCTA CTACCAGCGT 
GGGTATTACT ACGCTGACTT GGCGGTGAAG TACCTGGATC TTGCCTATAA CGCAGGGTTC 
AGGGCTGCGG ATTTGGCGGA GTTTCGTGGC ATGTCTGCCT CTTTGCTCGG AGATATGCAA 
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AAGGCGGTTG AGTCGTTCAC GCAGGCTCTC GCTGCACAGC CCTCTGATCT TGTGCTCTAC 7920 

GCGCTGGCAG AGTGTTATGA AAAACTTTCT GATTTTTCGA AGGCGAAgCT GtATCTGTAT 7980 

GATACCATCG GGAAAACAAA GGATGTTTTG CTTGAGCTAA AGTGcAGGAA TAGGCTTGcT 8040 

GCGCTGTATT TGTCTGAGCG CAACCTTGCA GAGGCTGAGC GAGAGCTGGA TGTGGTTTTG 8100 

CAAAAGGATG AgCGCTCTGC GGAGGCCCAC TATCATCGCG GGGTTCTGTA TGAGATGGGT 8160 

TCGGATTTGG TAAGGGCGCG GGCGGAGTGG CGGCGTGCCC TGAGGCTGAA TCCACTGCAC 8220 

GAGCCAACAC GCGTGAAGCT GAACCTGAAA TAGCTTGGAG GTGCCATGTT TTTTCTCAGA 82 80 

CGATTTTCTG CTGACGTGGG TATCGATCTA GGCACGTGTA ACACCATTAT CTATGTGGAA 8340 

GGAAGAGGGA TTGTCGTCAA TGAGCCGTCT GTGGTGGCAG TTGAGCGGGG AACGAAGTCA 8400 

GTAGTTGCGG TAGGCTCGGA CGCGAa gCGC AtGTTGTGGA AAACTCCGGG AAATATCGTT 8460 

GCGATACGGC CGTTGAAAGA CGGTGTGATC GCGGACATGG ATaCTACCGA GAAGATGAtT 8520 

CGTTACTTTA TTTCTAAAAT TTTGCCGCGC CACAGGCTCA TTAAACCGCG GATGGTCATC 8580 

GGGATTC CC A GTTGTATCAC GGATGTGGAG TGCAGAGCAG TGCACGAGAG TGCTAGTAAG 8640 

GCCGGGGCTG GGGAGGTGGA GGTACTTGAG GAGTCACTTG CTGCAGCCAT TGGCGCTAAT 8700 

ATTCCCATAG AAGAACCGGC AGGGAACATG GTGTGTGATA TCGGGGGGGG TACCACGGAG 8760 

GTGTCGGTTA TCTCGCTCTT GGGTATGGTG GTCACGAATG CAATTCGTGT TGGGGGCGAT 8820 

GAGTTTGATC AGGCCATTAT CAAGCACGTG CGATCCGTTC ACAATTTGAT TATTGGGGAG 88 80 

CAGACTGCAG AGCGTTTGAA AATTGAAATA GGGAATGCTT CTCCGGAAAA GAATATTGAA 8940 

AAGGTGGAGG TCAAGGGAAC CGACGCCATC ACCGGTCTTC CTCGCAGGCT TGAGATAGAT 9000 

TCTGTTGAAG TACGTGAGGC GCTCAAAGAG CCTATCACGC AGATAGTGGA AGAAATTAAG 9060 

CGGACGCTTG CTCGAACGCC TCCTGAGTTG GCTGCGGATA TCGTCGAACG GGGCATCGTC 9120 

ATGACAGGCG GAGGCTCTCT CCTCAAAGGT CTCCCTAAAC TTATTTCTAA GGAAACGCAT 9180 

GTGCCGGTTA TCCTTGCAGA GAATCCCATG AACTGTGTTG CTATCGGCkC AGGAAGGTAC 92 40 

CACGAAGTCT ACAAGGATAT TTCAGGGGAT CGTAGTCTGT ATGCGGGACT GAATTCATGA 93 00 

t TAGGTGG AA AAGGCTTTTT TTTTTaGAAT AGACTCTGAT CTATTCACCT TTATCGTGTT 93 60 

TTTGCTTGTT TCCTCAGgTC TCTTGGTCtT CTCAGGAGGG GAGCTGATTG TAAGCTTTAG 9420 

GGATGTGGGG TTCTCCGTTA CCTCCCGCGT GGAGAAGGCT GCAGCTTCGG TTTCTTTTTT 9480 

TGTTACTCAT ACGGTCAAGA CGTTGAAAAC CCTCTCAGAG GTGCAAAGGC GGTACGAGGT 9540 

CTTGCGCGAA CAACTGAAAG ACTACGAATT CTTGCAAGGA TCACGCGAAA GTTTGAGAAA 9600 
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GGAAAATCaA AGGcTACGCG CCATGCTTGG GTTTTCCCGC GAGCTTTCAA CGCGCAACAT 
TCCTGCAGAG ATTATAGGTT TTGACCCCGA CAATTTGTAC TCCGGTATTG TTGTTAGCAG 
GGGTGCGCGG CACGGGGTGC GCAAGAATAT GCCTGTTGTT GCATTTCAAA GTGACACATT 
GGGGTTGGTT GGAAAAGTGG TGCAGGTTTC GCGTACCACG AGTATGATAG TGCCGCTTTA 
TCACTACCAA TTCTATGTTG CCGGAAAACT TGAGCGTGCT CAGTATCGGG GATTGATTAG 
TGGACAGGGG GGTAGTGACT TTCCCCTTCT AATGCGTTAT GTGAAGAAGC ACGGACAGGG 
AAGTATTCGT GTCGGCGACC TCGTGGTAAC TTCGGGGGAA AATTATCCTT TCCCGAAAGA 
TGTACCCGTC GGGAAGGTGC GGGACATTAA ACTCCACGAC CATGAAACTT CTCTTGAACT 
TTCTCTTGAC CCCGTTTTAG ACCTTTTCCG TTTGGAATAC GTTTTTATCC TCGACCTGTC 
CTTGTCCCAA GAAGGACCGC ACGGATGATA CGGCTCATCG CCTGGTCTGT AGGTACCTCT 
TTTCTTTTTA GCATTGTAGA GATGGCAGTG TTCGTACACG TTTCGTACTT ATCCATTATG 
CCAGATCTCG TCTTGCTCGT AGTACTGTTC ACGAGCATTC ACAATGGCGT GGTGGCAGGG 
ATATGGACTG G ATTTA t TGC AGGAATTATT TTTGACTTCC TTTC TATCTC TCCCTTTGGT 
TTGCATTCGT TCGTTTTCAC CACTATAGGC TTTATGGTAG GAAAGGTGCA GGGaAGATAT 
CATATCGaTA GAGTATTCGC CCCCGCGGTA CTGGCAGGCT TTGCAATGAT TTTCAAGGTG 
GGATTGGTGT TGGTATTGCG AGGAGTGTTT GGTCCAAATA TCCAAGTGTA TAGCGTGTTT 
TCACGcAGCT TTGGATAGAA ATGACG TTGA ATATTGTGTT TGTCCCCTTT GTATTCGGGC 
TTTTGAATAT GTTTCCGACC ACTTTTCTTT ATAAGAGGTT TTCTTCGTAG ATGCGTTATT 
TTTCTCTCCT TCCTGATCGT CATATGCTTT TTAGGATAAA GGTTCTCACC TGGCTCGTCG 
TGcTGGTTAT GCTGTTGTAC ATGCGGCAGC TGTTTGTCAT TCAAATCGTG CGGGGGGATT 
CGTTCAAAAA AAAATCGCTG AACATATCTC AGCGTAgTAA AGTAATTCCT GCACAACGGG 
GGGAGATTTT TGATCGCCAC GCGGaTCTGC CCATGGTGCT GAATGTCAAT TCGTTTGCAG 
TTGATATGAT CCCCGGAGAG GTTCCGCCTG AGCAGTTCGA TACGGTGCTC AACAAATTGT 
CGCATATTCT GCGCGTACCT ATTTCGGATA TTCGAAAGAA AATTCCTGAT GCGGTCCGCC 
GTTCATTTCA AACGGTGGAG TTGCGCAGTA ACGTGAGTTA CGAGGACATC ACTGcTATCG 
CCCAAATAAT TGATGAACTG CCGGGCGTTT CTTGGTATTC AAAACCAGTA CGAAATTACG 
TTGAAACAGG ATCATTCGCT CACGTTATCG GATATGTGGG GGAGATTACA AAAGAAGAGC 
TCAAACGATT TTACAGTAAA GGGTACAGGC CCAACAGTCT CATTGGAAAG GCTGGAATTG 
AAAAAGAATA CGACGAGGTC CTGAGAGGGA AAGAGGGACA CGAGTACCGG ACCGTCGATG 
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CCCGTGGGCG ATA CAT AG AA AACACTTCGG TTACTAACCC TCCTCGCATG GGTAATAACC 
TCGTGCTCAC CATCGATCGG CGTATACAAA AACTTGCAGA AGACGCGCTC GGTCCTCGTA 
TCGGAGCGGC AGTGGTACTG AAACCGACAA CGGGAGAAGT ACTTGCTATG GTATCTTATC 
CGTACTTTGA C C AAAAC ATT TTCACTCAGC ATAACGCCCA CGAACTGTAT GCGCAGyTTT 
CACATGATAC ACGGTTCCCT CTGCTTAACC GTGTTGTGAA TGCAAGTTAC CCGCCTGCGT 
CGACGTTCAA GATkGTCaTG TCAACCGCTA TTTTGGCAGA GAAGGCATTC CCCCATGAAA 
AGACGGTGGA CTGTCCAGGA GAGATCGAGT ATGGCAATCG CTTATTTCGC TGTCATATCA 
GAAAGCCTGG GCACGGCAAG GTAGATCTCC GTCGTGCGCT TGAGCAGTCG TGTGATATTT 
ATTACTGGAC AGTCTGTCGA GACTATCTTG GCATCGACCG CATGATTTCG TACATCAACG 
ATTTTGGATT TGGCAAATCG GCGCGCATCG ATTTACCCAG TCAAACAGAG GgTATGGTTC 
CAACACCGAA ATGGAAAGAA CGTCGGTTTC ATGAAAAATG GTTGGATGGA GACACTATGA 
ATCTCGCTAT CGGGCAGGGT TACATGCTTG TCTCGCCTCT GcAGGTGGCA AACATGGTCG 
CGATGACCGT TAACAATGGC GTCATTTATC GGCCCCATTT ACTCAAGGAA ATTCGGGACT 
CTCGTACTAA CGAATGCTAT TTAGGCATAA ACCTGAGGTA TTAAAGACAG CAAAAATTCC 
TGCAGAGATA TTCGAGCACG TGCGCGCAGA TATGCATTCG GTTGTCACGC GTGGCTC CTC 
CCAGTATGCA ATGAAAAATA AGACCGTGTC CCTGGCAGGG AAAAC TGG T A CTGCAGAAGT 
AGGTTTTCAC AATCGGTGGC ATTCGTGGAT GGCAGCGTAT GGGCCTTATC ATCGCCCCCC 
GGATGAAGCG GTGGTCGTTG TGGTACTGGT AGAGGCAAGA AACGAATGGG AATGGTGGGC 
GCCGTTTGCA ACCAATATCA TTTTtCAGGG TATTTTTGCG AATGAGGATT ATGAGCAAGC 
AGTTGAGTCG CTCAAGTCGT ACGGCATTTC CCTTGGGGTG CCGGCAAGGA GTCGGCAGGA 
ATGAGGATTC GCGGTGTCAG TGATTT t G AC TACCTATTGC TTCTGACCAT GCtGGCGTTG 
ACCArCATTG GTATCTTGTT CATC TAT TC T TCCGGGGTAA ATTCAGAGGG ACACGTTATT 
TCCAGAGAAT ACCTAAAACA AATAGTGTGG GCCGTCATGG GTGTGGTGCT CATGCTTTCT 
GTGAGCATGT ACGACTACCA CAGGTTCAAG GATAGAACAA CGCTTATTTT TGCAGGTTTT 
ATATTGCTGC TGATATACAC GCGGTTGTTT GGGCGGTATG TAAATGGTGC AAAAAGCTGG 
ATCGGTGTGG GAGAATTCGG CATTCAGATT TCTGAGTTTG CAAAGATCGC GTACATATTA 
TACTTAGCGC ACTATCTTGT TTATTCTCAG AGTGAGCCTA TGCTTAAGCG CTTTGCGAAA 
GCGGGGGTGA TTACCTTGCT GCCCATGGCG CTCATATTGT CTCAGCCGGA TCTCGGCACT 
GCATCCGTGT ACCTGCCGAT TTTTCTCGTT ATGTGTTTTA TTGCAGGATT TCCTCTCCGT 
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TTGATTTTCG CGGTGGTTTG TGTGGTCCTC CTGACTTTGC TCTTTACACT GTTGCCCCTT 
TGGGAGCAAA CCTTTTTGCA ATACCAGGGG GTGGCTACGC GCATTGCAGA TTCGCGTATG 
CTGTCGCTGT TTGTGTTTTT TTCTCTCAGC GCTACGTCTG CGGTAgcGGT GGTAGGGTAC 
CTGCTCTCTG GAAGAAAATA CTACTACTGG ATTACTTACG CTTTGGGAAT GGTGAGTATT 
TCTTATGGCG CATCGCTGCT GGGAGTTCGG GTTTTAAAAC CGTATCAGAT GATGCGC CTG 
ATCATTTTTC TCAATCCCGA GGTAGATCCA CTCAAAGCGG GATGGCACAT TATCCAGTCA 
ATGATCGCTA TTGGCAGTGG CGGTGCGTTT GGAATGGGGT ACTTGAGAGG ACCGCAGAGC 
CATTATCGAT TTTTACCGCA GCAGAGTACT GATTTTATCT TCAGCATTCT TTCTGAAGAG 
TGGGGTTTTG TTGGCGGGGT GATAGTGTTT GGTTTGTATC TGTTGTTCTT TCTGCATACG 
CTTTCCATCA TGAGTCACGT TGATGATTTG TACGGTAAGC TCATCGCAAG CGGTGTGTTG 
GGTATGTTCC TTTTTCACTT TGTAGTTAAC GTGGGCATGA CCATGGGAAT CATGCCCATT 
ACGGGTATTC CTCTGTTGCT CCTTTCGTAT GGTGGATCGT CTCTGTGGAC CGCGATGATT 
GCAACGGGAC TCTTGATGAG TATCAATGCA AGGCAGTTGT AAATAGAGTA AGGAAAGGAC 
ATTTGGTATG AAGGTGGTTC TCTTTTATGA TCAAGGAAGA GCGCATTCAG TTGCTGCGAT 
ATGCGAGGTG CTTTGTGCAC AAGGATGCGC GGTAACACCG CATGCGATTG AGCAGGTGTG 
GAACGACACA TCACCGTGCA GTaCgcCTTT GGCnTTGGTA CAGGATGCAA CGCATGTGTT 
TTTTTTGTaC gcGCATGAGC CCATGCGCGA TcCGGCTTTT ATTTTCTTTT CTGGAGTTGC 
TTGTGGGCGT GGTATGCACG TGCTG CTCTT GGCTACAACA ACGGAGGTCA GGGATATCCA 
TGTATTTCGC GACTTGGTCT TTTTACTTGA GGAGGAGACG TTTGAGGATT TCTTTCGTGT 
CGAGCACGAG AGATTTGTAA GGCAGAAAAA GAAGCGTGTC GC AC GC AC TG CGCTGTTAGA 
GCGCGGTTAT CCATGTTTTG AAGAAAATTT CATCGCGACA GTCATGGATG GGAATATTGA 
TATTGTCAAT CTCTTTTTGG ATGCAGGATT TAGCGCTGCG TTGAAAGACG CACGCGGTAC 
gCCTGTGTTG TCTTTGGCAG TGCGGGAGGG TCAGGATGAG ATGGCAGCGC AACTTnATTG 
nCGGCGGTGC GCCAGTAGAT CCAGTTAATG GGATCCTCTA AGTAGTTAAT TA 
(2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS: 



13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
13800 
13860 
13920 
13980 
14040 
14100 
14160 
14220 
14280 
14340 
14400 
14460 
14512 



(A) LENGTH: 3569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

CCGCCGCCCG CGTATTTCTC GCATTTCTCG TGGGTTATGC CCGAGAGAGT AGTACAGGAA 60 

CATATATCGT GGCG TTATAA GGCTCTCAAG CAGGATCATA TGGAAGTAGA CTGTGATGTC 120 

TATGCTTACC GTGGAGGGCG GGTGTTTCCC CGTGTGTCCG GTATGGGGGT TATCGATCAT 180 

ACAAACATCC CACATGCTCT GCGTATTTTT TGTGAAAAGA TGACTGATTC TTTTATGAAA 240 

AAAAAAATAG ATCCACACCT GTGCCAGAGA GAGAGAAAGT TTTTACCCCA TTACTTTTCA 300 

TTTCGAATGG GTAAGCTGCC gCGTATTTGC GCAGTTGTGT TTGCTCAACC GAATGTTATG 3 60 

CAGGGTAACT TTCTTACCGT TCATTTTAAA TTAAATGTAG AAAACGAGGA TTCTCGTATC 420 

ATAGAAGTCA CGG TGGCGAA AGAGCAGGAG AATTGGAAAC TATTCCAATT TTTATTTAAA 480 

GAGGATCGCG CGCATCTTGC TGTCTTGTAA GTGATGTCTG AGTCTGTAAA GGAGATGGCC 540 

GGAGGATGAA AGGTCAAGAT GTCATCCTGT GCGACGGGGG ACGTCATTTT TCATATAAGG 600 

TACTTC CTCG TGTGGTCATT GTGGGAAGAC CGAATGTAGG TAAGTCGACA TTATTCAACC 660 

GCCTGCTCGG TAGACGGCGC TCTATCACCA GCAATACGTC AGGGGTTACA AGAGATTCGA 720 

TTGAAGAAAC CGTGATTCTG CGAGGGTTTC CTCTTAGACT TGTTGACACG AGCGGTTTTA 7 80 

CCGTTTTTTC TGAAAAAAAG GCATCGAGAC AACATATCGA TACTCTCGTG TTAGAACAAA 840 

CGTATAAATC AATACAGTGT GCGGACAAAA TCCTTCTTGT GCTTGATGGA ACGTGTGAAA 900 

GTGCAGAAGA CGAGGAGGTT ATCCAGTATC TGAGGCCCTA CTGGGGCAAA CTCATCGCTG 9 60 

CGGTTAATAA GACGGAGGGA GGAGAGGAGG TGCATTATAA TTATGCACGG TACGgTTTTT 1020 

CTACCC TTAT CTGTGTCAGC GCCGAGCACG GTAGGAACAT AGACGCGTTG GAAAGGGCGA 1080 

TTATCCAAAA TCTGTTTTCT GTCGATGAGC GCCGGGAACT GC CGAAAG AT GATGTTGTTC 1140 

GTCTTGCAAT AGTGGGTAAG CCGAACACAG GAAAATCCAC TTTGATGAAT TATCTCATGC 1200 

GCCtACCGTT TCTCTGGTGT GTGATAGAGC AGGTACTACC AGAGACGTGG TAACCGGTCA 12 60 

TGTTGAGTTC AAACAGTACA AATTCATTAT CGCAGATACG GCGGGTATCA GAAAAAGACA 1320 

GAAGGTATAT GAGAGTATAG AGTACTACTC GGTAATACGA GCAATTAGCA TCCTGAATGC 13 80 

CGTTGACATT GTATTGTACA TCGTCGATGC CCGAGATGGA TTTTCTGAAC AAGACAAGAA 1440 

GATTGTTTCG CAAATCTCAA AGAGAAATTT AGGTGTGATC TTCCTTTTGA ACAAGTGGGA 1500 

TTTGTTGGAA GGAAGTACCT CTCTAATAGC TAAGAAAAAG CGTGATGTAC GGACTGCTTT 1560 

TGGGAAAATG AATTTTGTTC CCGTGGTACC TGTATCAGCT AAAACGGGGC ACGGTATTTC 1620 

TGATGCATTA CATTGTGTAT GTAAGATCTT TGCACAACTA AATACAAAAG TGGAGACTTC 1680 
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CGCTCTCAAT ACTGGCATTG AAAGATTGGG TAACGTCGTA TCCTCCTCCA AGAAAGTATG 
GACACGTTTC GTTAAAGTAC CTGGtGCAGG TATCGGTTAG ACCTATTGAA TTTTTGCTTT 
TTGCAAATAG GCCAGATCGT ATAC CGGAAA AC T ACGTTCG ATTTTTACAG AATCGTATTC 
GTGAAGACCT AGGATTAGAC TCTATCCCTG TGAAGCTAAC CATACGGAAA AACTGTCGGA 
AGCGATAGAT GCAAGATGAA GGAGTGGATA TGAAAAAACT TCTTTTACGT TCTTCTGATG 
AAGTTCGAGT AATCGCGCCC TCGTGcTCAA TGCGTAAG AT TGATTCATCG GTAATTGAGC 
GTGCACAGGA GCGCTTTCGA TGTTTGGGTC TCAATGTTGC TTTgG AG ATC ACGTGTACGA 
CGAGGa TTTT TTAGtTCTGC ATCTG TTG AT AAAAGAGTTG CGGATCTCCA TGCTGCCTTT 
GCAGATAAAA AAGTAAAGTT AATcTCACTG CAATTGGAGG ATTTAATTCT AATCAACTAT 
TGCAGCACAT AGACTATGCT CTTTTGAAAA AGAATCCtAA GTTGTTGTGT GGTTTTTCTG 
ATGTC AC TGC GCTATTAAAT GCAATTCATG CGAAGACAGG AATGCCAGTT TTTTATGGTC 
CACATTTTTC GACATTCGGT ATGGAAAAAG GTATTGAGTT TACTATTGAA TGCTTTAAGA 
ACACTTTTTT TTATGGTCGG TGCGATATCT TAGCATCCGA AACATGGAGT GATGATATGT 
GGTTTAAGGA TCAGGAACAT CGCCAGTTTA TTACTAATCC TGGGTATGAA ATTATCCATA 
GAGGAGATAT GGTCGGGATG GGGGTCGGAG GAAATATTAG TACATTTAAT CTTTTAGCAG 
GTACGGAATA TGAACCGTCT CTGAAAAAGA GTATTTTGTT TATAGAGGAT ACGTCTCGTA 
TGTCAATTAC AGATTTTGAT CGCCACTTAG AAGCACTTAC ACAACGGGAT GATTT TTG T A 
CGGTGCGTGG CATTCTCATT GGCAGATTTC AAAAGGATTC AGGTATTGAT ATGGACATGT 
TGCGAAAAAT CATTTCGAGA AAAAAGGCTC TTGATGCTAT TCCTCTATTT GCAAATGTAG 
ATTTCGGGCA TACGACCCCC CATTGCATAT TACCTATTGG GGGAATGATT CGAGTTAATG 
TTGATAGAAA ATGTATTACT GTTCAGTTGC ATTCCTCAGT TGAGCAACTC CCAGAGTAAT 
TTCGGTGAAT GATGTTCTTG CGTTACCATT ACGTATGCTC GCACACTGCC TGAAATGCTC 
ATTGGAGAAA TAAAAGAGCC AGTTTCTGTA CTGAAGGGAA CAGGGAAAGT TGTTCTTGCG 
CAGTTGGAAA GGCTAAACAT TAGCACTATT GGAGATATCC TTTCGTACTG GCCTCGTTtg 
TGGGwwgrkA GAACGCAAGA ACAGATGTTT TCCCAATGGA cgCTGGCGCA TAGATTGCAA 
GTACGAGTTA GTGTCACTGC ACATTGCTGG TTTGGATTTG GCAAGAGCAA GACTCTCAAG 
CTTGTGGTAC AGGATGGCCA AGGATGCGTC GCTGAATTGT TATGTTTTCG CCGTAATTTT 
TTGCATTTTA TGTTTCCTGT TGGAAGTGAA GCAGTCGTGT ATGGAAGTTT TTATGAAAAG 
GATGGGTTGC TGGAAAGTAG TTCATTTGAT ATCGAAAAAA TCGATTGTAT TGAAAAAAAG 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
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ATTTTGCCTG TCTATCCCTT AACCAAAGGG TTAAAACAAA TGAAATTAAG AATGCTCATT 3480 
TGTGCAGCAA TGGATCAATG GATTGGCACG GTTGATTCTG AATTGCCCAA ACCTATTCTT 3 540 

GAGAAATATC ATCTACTCAC AAAACGAGA 3569 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

TGCTGAATTC TTCCGCGCGT AATCCTGTCG CCCATGCTGC CTCTCGCGTT ATTGAGGCTC 60 

CGGTAAGTGA GGGAGCGAAG AGTTTTGCTG GTGAGCGTGT CCTTGGTGTG CGCGTGTTGT 120 

TCCCCACGTG GGACAGTAAC GCAAACGCAA TGATAAAGCC GGCGTTCGTA ATTCCTGCGT 180 

ACGAGGTGAT GGCTCAGGTG GACGATCAGG GTAATGTACA GGCCCCCACA GAGGAGGAGA 240 

AGGCTTCTGG AAAGGGGCGT TTTGAAGATG GGTACGGAGT GGTAAAGAAT GTGGGTGTTC 300 

TTAAGTCCAT CGCGGTGAAC ACTTACGGGA TGAATTATCC TCATGGTTTG TACGTGATGA 3 60 

TGCGGGATCA GGATGGTGAG GTGCATCGCT ACTTCATGGG GTATCTCCTG TTCGACTCCT 420 

GGAAGatTGG TGTGGAACAA TCCTTCGTAT ATCTCTGATG TTCGGTCGCG GGAGGTGCGC 480 

TTGTATCCCG TGTATCCCGC GTCGACGCCC CACGTCGTGT TTGAAGGCTT TATGGTTACT 540 

AGGGACGCGG CTCATGCCGG AGGGGaCTAT GTTGGTTATT TCAAGGACGT CAAGATTATC 600 

TATGATAAGG CGGTGCTGAG TACGGTGCGC GATTTTGCGG ACGAGGACCT- GTGGGGTATC 660 

CAGGCGCGGC GTGAGGCTGA GCGTAAGAGA GTTGAGGTTG CGCGTTTCGG GCAGCAGCAG 720 

GTGCTGCGTT ATATAGAGCA AG AGAAGC TT GCTACAGAGG TTGGTTTTAC ACCCTCTGGG 780 

GGTGCTCAGC GGCAGGAAGA GCAGCAGTAG TGCAGTAGTC TTCCTAGGGA gAGGGGGCGG 840 

TGGGGTTCTA GGCGCGGGGC GTGTCTTTTC CCTCTCTTCT TTTCTTGGGT TTTAGCGGTG 900 

TTTTGGCGTT CGGGGAGGTC GGATGGGTAG GAGTGTATCC GCCAGGAAGA GGCATGATCA 960 

GAGTGAGGTG CGTAGGATGC GTGGTAGGAT GGCTAGGTCT GCGGCGCGTA CTTGTGCGCG 1020 

GAGGTATTTG GCTGCTGTTA CATCCGGGGA TAGGGAGAGT TCTCTGCCTC TACTTAGGAG 1080 

CTTGGTGAAG CGACTTGACA CCGCTGCCCG GaAAGGTGTT TTCGCTAGAA AGGCTGTGGC 1140 

TCGCCAGAAG TCCCGAATGT GTAGACTGTA CAACGGTGTG TTCTCTTCAc CCGAGGTGGT 1200 
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GCGCGTTTGA GGCGGCTGTT TGCCCGCGTG TGTTTCTTGT CGTGAAGAGA GTTAGGAGAA 1260 

CGCGGTCTTT CGTTGTCGAT GCACTTTGTG ACGAGgTGGA TTTGAGCCGT CGCCATGTCG 1320 

CGAGGGTTGT TGATAGCTTT GTCTCTGTGG TAACCGCTGC ATTGGAACGG GGGGAGACAG 1380 

TCGAGCTGAg GGATTTtGGG GTGTTTGAaG TCTCGCGTGC GTAAGGCTTC CGTCGGGAAG 1440 

AGCATAAAGA CAGGGGAgGT GGTCTCTATT CCAAGTCATT GTGTGGTAGT GTTCCGCCCC 1500 

AGCAAGCGTT TAAAGAGTGC GGTGCGGGGA TATCGTTCGG GGGAGGTTGG TGCGGATTGA 1560 

GGAATGGTGT CGTTCCCGTC TGGGCGAGTT TTTGTTGTTT GTTCTGGCGG TTTCCCTGTT 1620 

CGCGCTCTCT CACCCTAACC CTCTGCTTCC CAGAGGGTGT GCTCTCCTAG CGTATGGGGC 1680 

GCTTGCTCCT CTCTTCCTTT TGGTAAGGTG GGCCTCGGGT TTTGCGGTTG TGTTCTGGGG 1740 

GGGTGCGTAC GGCGCGTTCA GCTACGGTGC GTTTTCTTAT TGGCTTTTTG TATTTCATCC 1800 

GGTGGCGTTG TGCGTAGTTG CCGGCTTCTC TGCGCTTTTT CTTGCGGCGC TGTGTCTTGC 1860 

GCTGAAGGCT GGTGGTGCAT TTTGGCAGCG GCGGGCGCTT CTCGTGCAGT GTCTTGTGTG 1920 

GCTTGGGTAT GAGTACGCGA AGACGCTTGG TTTTCTTGGT TTCCCTTACG GGGTTATGGG 1980 

TTATTCGCAA TGGCGTGTAC TGCCGCTTAT CCAAGTTGCA TCGGTCTTCG GTGTGTGGGT 2040 

TGTTTCTGCA TTGGTGGTTT TTCCTTCAGC GTGGCTCGCA TCTGTCCTGG GGCAGTGGGT 2100 

TGAGGAAAGT GAAAGGAATG CTCGGGCGTT TTTGTCTGCC GCGTATAGCC ACTGGGTTTC 2160 

GGCGCTGGTG TGGGTTGGTC TGTGTGGGTT TTGTGTATGC GCGGCCAAGG CGGGATGGTG 2220 

GCCGGATTGC ACAGCTCACA CGCGGGCAAA GGTTGCGCTC GTTCAGCCTA ATGGTGATCc 2280 

GCGACGCGGC GGTATCGAGT CATATCGGGC GGATTTTAGC ACACTGACGT ATCTTTCTGA 2340 

TTGGGCGCTT GAGCGGTATC CAGATGTTGA TTTGGTGGTG TGGC CGGAGA CGGCTTTTGT 2400 

TCCTCGCATC GACTGGCACT ATCGCTACCG GCACGAACAG CAGTCATTTC AGTTAGTATG 2460 

CGATTTGCTG GACTACGTGA ACGCCAAGAA CTGCCCGTTT ATTATCGGTA GTGACGACGC 2520 

ATATAAGAAG CGCACGAAGG AGGGGAATtG GGAACGTGTT GATTACAATG CGGCGCTTCT 2580 

TTTCATTCCT GGGGTGAACG TGCTTCCGCC GAGTCCGCAG CGGTAC C ATA AGATAAAGCT 2640 

TGTTCCCTTT ACGGAGTACT TTCCGTACAA GCGGGTATTT CCCTGGTTTT ACAACTTCTT 2700 

GGAAAAGCAG GATGCGCGCT TTTGGGCCCA GGGGAGTGAA TTCGTTGTGT TTGAGGCACG 2760 

AGGGTTAAAG TTTTCTGTCC CGATTTGTTT CGAGGATGCG TTTGGGTACA TCACGCGTGA 2820 

GTTCTGTGCG CGTGGTGCCT CTTTGCTCGT CAATATTTCT AACGACAGTT GGGCAAAGAG 2880 

TCTTTCCTGT CAGTATCAGC ACCTGAGTAT GGCGGTGTTT CGCGCAATCG AAAACAGGAG 2940 
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GGCACTGGTG CGTGCAAGTA CGTCTGGCCA GACGGTTGCA ATTGCGCCTG ACGGGCGTAT 3000 

ACTCGATGAA CTACAGCCCT TTGCCCCGGG AGTTTTGGTG GCGGACGTTC CGATTGTCAC 3060 

ATGCGCATGC GGAGGCTACC GGTATTGGGG GGACGCGTTG GGAGTCTTTT TTTGTGTGGC 3120 

GTCCCTTTTT ATATTGATTG CTGGTGGTGT GCGCCATATG CTGAGATGCA GGAGGGGCGG 3180 

GTGGCGTTGA AACGGGTTAG CGAAGGGCAT GGCAAGACTG TTCTGGGTGC GAAGACGGTG 3240 

TTCGACGGGG TATTGCGATT CAAAGGTAAC CTGCACATCA GGGGAAAGTT CTCCGGTGCT 3300 

ATCGATGCGC AGGGCTGTTT GACCATTGCG CCGGGTGCGG TGTGTGCAGT TCAGTACGCG 3360 

CGTGCTGTTT CTATTTTTGT TGAGGGGGAA GTGAGAGGGA ATCTGACGGT GGTTGATCGT 3420 

GTGGAGATGA GGGATGGAAG CCGAGTGTTT GGGGaTGTCA CTGCTTCTAG AATTAAAATC 3480 

TGTGATGGAg TTACGTTTGA GGGGTCTGTT TGCA t GACTC GGGAAGGGAa TGTTTCGAAG 3540 

CGGGATCTAT TTTCTGTCCA GTCTGAGCAA TTGAAGGAGC ATCTGCGTCG TTAGCGTAGA 3 600 

TATGGTTGGG TCTTGACTGA ATGCCtAAAA GAGGCGCCAC AGTTCCTGTA TACACCACGT 3660 

GAAGTTAAGG GTGTCGTCTT CTGTTTTCCT GGTGTTCTAG TCTTTAGCCA ATTTAGGTGA 3720 

GAGTGTTCTT GGGCGTGTAC TCGTTGGACG TCGGTTTTTC TTTCCAGGGT TGTAGCGTGC 3780 

ACGGTGCTGC GTGCTGTTCA AACCGGTGTC GGTAATCTCG GTGTGTAAGT TATGAAAGTT 3840 

TCTGTTGGTA CCGTCGTC 3858 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 878 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

TCACCATATG GAAATCGCGG TTTAGGAATC ATCAATATTA CACAGCTgtA CGGGGTCGGT 60 

TCTATCAGGG AGCGGAAAGA AATACAAATG GTGGTTCAAC TTGAAGAGTG GAATTCTTCA 120 

AAGGCCTATG ATCGTCTCGG TACGCAGGAG CTGAACACTA CTATTTTGGA CGTCAGTGTT 180 

CCCCTTATAG AAATACCGGT AAGGCCCGGA AGGAACATCC CCATCATCCT GGAGACAGCT 240 

GCTATGAACG AGCGTTTAAA GCGTATGGGC TATTTTTCTG CAAAGGAATT CAATCAGAGC 300 

GTACTCAAAT TGATGGAGCA GAATGCAGCA CATGCACCGT ATTATCGGCC AGATGATACG 360 

TACTAGGGGG CTAAAAAACG TGCGGTGTAT GGCGGTGGAA GGAAAGCATA ATGGTCGTAA 420 
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AAACGGTGCG CGTGCTTAAT CGTGCGGGCG TACATGCGCG TCCTGCGGCG CTTATTGTGC 480 

AAGCGGCAAG TCGCTTTGAT TCGAAGATAA TGCTTGTGCG GGATACGATC AGAGTGAATG 540 

CAAAGTCTAT TATGGGTGTT ATGGCTATGG CTGCAGGGTG TGGAAGTGAG CTCGAGTTGG 600 

TTGTAGAAGG TCCAGACGAA gTTGCTGCAT TGTCCGCCAT TGAGCGGCTA TTTCAGAATA 660 

AATTCGAGGA AGAGTAAATA CGCTCTTACG TGTTAGAACG CCTGTGTTTG TGCTCTTTGC 720 

GTGATAGGGG TACTGTACAC TGAGATAGGG AAGGGGCAGA AGGGATGTCC GTCTGGCTTT 7 80 

TTACCGGACC TGAAATAGGG GAGCGAGATA GTGCAGTTCA GGAGGTGTGC GCGCGTGCAC 840 

AAGCGCAAGG GACGGTGGAC GTACATCGGC TCTATGnG 878 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5819 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

TCCAGTCTAT TAATnGTGGC CGGGAAnCTA GAGTAAGTAG TTCGCCAGTT AATAGTTTGC 60 

GCAACGTTGT TGCCATTGCT ACAGGCATCG TGGTGTCACG CTCGTCGTTT GGTATGGCTT 120 

CATTCAGCTC CGGTTCCCAA CGATCAAGGC GAGTTACATG ATCCCCCATG TTGTGCAAAA 180 

AAGCGGTTAG CTCCTTCGGT CCTCCGATCG TTGTCAGAAG TACAGGATCA CTCGCAGGCA 240 

ACATTTTGTG GnAAGCTCTG TAGGGAGATG GGATTGGCGG AC TGGAGTAA TCCTGCAGTT 300 

GTGTTGGAGC GCAAGATTCG GGCCTTTACT CCCTGGCCGG GTCTATTCAC CTATAAAGAT 3 60 

GGGGAAAGGA TAG C GATTTT GCAGGCGAGG TCGTGTGAGT CTTCGTTTGT TCCCCTCGCT 420 

CCTGTGGGGA CAGTGCTTGC TGCAGATAAA AATGGGGTGT TTGTC C AGAC AGGCGATGGA 480 

GTTCTGTCCC TTTTACAGTT GCAGCGCTCC GGGAAAAAAC CTCTGTTTTG GAGAGATTTC 540 

CTCAATGGTT CCCCTCTATT GCTGACAGGT AGGTTAGGGG TGTGAGTGAT ACACGCCAGG 600 

CGTGAGATTT CTACGCAACG CATGATGCGT ACCCCAAGTG TGTCTTGTTA CAGAGAAAGG 660 

GGAGGTTGGT TTGTCCGAAG AAATTCTCAC GATAGAAGAG GTTGCGCGGT ACCTGCGAAT 720 

TTCTGAACGT ACCGTGTATG AGTGGGCGCA AAAGGGGAAG ATTCCGTCAG gAAAAGTGGG 780 

CACCGTGTGG CGGTTTCGCA GGTCAGAAGT TGAGCGATGG GTTGACACTT GTCTTTCCTG 840 

TTCTCACAGA CAGAGCCATT CGGATGTTTT GCCCATTGAG CGGATCCTGT CCACCGATCG 900 
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TATCCTGCAT CTTGAACAGT CTGAGCGTCG TCCGGCGCTC TATGAGCTTT CTGATTGCTT 960 

GAGCACTGCA CCTCAGATTA AAAATCGTAG CGAGCTTGCG GCAGAAATAG TGCGGCGCGA 1020 

GGAGCTCATG TCGACTGCAA TTGGGTGTGG TATTGCAGTT CCTCATGTGC GCTTGTCTTC 1080 

TGTAACTGAT TTGGTTATGG CGGTAGGAAT TTCAAAAAAA GGTATTGCTG ATTTCGGTCC 1140 

TCTTGACGGA CAAGACGTAC ATCTTGTTTT TATGATTGCC GCTGCTACCA ATCAGCACCG 1200 

GTACTATTTG CAAACGCTTT CTTTTTTTAG TTCAAAATTG AAAAGGCCCG ATTTGCGGAC 1260 

GCGCCTCTTG CAGACTAACA CCGCGCTAGA AGCGTACACC GTGTTGACAG AGCAGTCTAG 1320 

TTTGTAAGAT TTAGAAGAGA GCAGGATTGT TCAGGCAGAG GGAAAGCATT GACCTATTTT 1380 

TTTGAAACGT ACGGGTGCCA GATGAATGTT GCAGAGTCTG CTTCTGTAGA GCAGCTCCTG 1440 

TTGGCGCGGG GGTGGACAAA GGCGGTAGAC GCGCAGACGT GCGACGTGCT GATTATCAAT 1500 

ACGTGTTCTG TGCGAATTAC AGCAGAAACG CGGGTCTTTG GGAGACTTGG CTTATTTTCT 1560 

TCTC TTAAAA AAAAGCGTGC GTTTTTCATT ATCCTTATGG GGTGTATGGC ACAGCGTTTA 1620 

CACGACAAAA TTCAGCAGCA GTTTCCTCGT ATTGATTATG T AG TGGGT AC GTTTGCGCAC 1680 

GCGCGATTTG AATCCATTTT CCAAGAAATT GAACAGAAGC TTACCCAGAA AGATTACCGC 1740 

TTTGAGTTTA TCTCCGAGCG TTACCGGGAG CATCCTGTCT CTGGGTATCG TTTTTTCGCT 1800 

TCTTCATATA GCGAAGGTTC ATTCCAAAGT TTTATCCCCA TCATGAATGG CTGCAATAAT 1860 

TTTTGTTCGT TTTGCATTGT GCCATACGTG CGTGGACGGG AGATCTCGCG TGATCTTGAT 1920 

GCTATTTTGC AGGAAGTGGA TGTGCTCTCT GAGAAAGGAG TGCGGGAAAT TACGTTGCTC 1980 

GGACAAAATG TTAAT t CGTA TCGGGGAAGA GACCGTGAAG GgAACATAGT TACCTTTCCC 2 040 

CAGCTGTTGC GTCATTTGGT TCGTCGTTGC GAAgTCAAAG ATCAGATAAA GTGGATCCGC 2100 

TTTGTTTCCA GTCACCCTAA AGACCTTTCT GATGATCTGA TTGCTACTAT TGCTCAGGAA 2160 

TCTCGTCTGT GTCGTCTGGT GCATTTGCCA GTGCAGCATG GGGCGAATGG AGTGCTCAAG 2220 

CGGATGCGAA CGGAGTTACA CGAGAGAGCA GTATCTGTCG CTGGTGGGTA AACTGAAAGC 2280 

GAGTGTCCCC AATGTGGCGC TGAGCACAGA TATTCTTATT GGGTTCCCGG GGGAGACGGA 2340 

GGAGGATTTT GAGCAAACGC TGGATCTCAT GCGGGAGGTG GAGTTTGATT CCGCTTTTAT 2400 

GTATCACTAT AACCCGCGCG AGGGAACGCC TGCCTATGAC TTTCCCGATC GTATCCCTGA 2460 

TGCAACGCGG ATTGCGCGTC TACAACGCGT CATTGCTCTG CAGATGAGTA CTACTTTGAA 2520 

AAAGATGCGC GCACGGGTAG GAAAGACATT GCCAGTGTTG GTAGAGTCGC GCTCGCGAAA 2580 

TAATCCTGAA GAATTGTTTG GACATACAGA GCTTGGGGAA ATGACCGTGC TTGAAGGAAA 2640 
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GGTGGATCCT ACGTACATCG GACGCTTTGT GGACGTGCAA GTGAAGGAAG TGCGCGGCAG 
GACCTTGCGT GCCCATCTGG TGCAGGAGCG TGCAAAATGA CATATGGAAA GCTGATTTTT 
TTTATTATCG TACTTGTGGG TTTCGCGCTC TTCATGTCCT TCAACGTGGA ACACCGCTGC 
GATGTATCGC TTGTCTTTTA TACtTTCAGG CAGTGCCGAT CACTTTGAGC TTGCTTTTTG 
CCTTTGCGTG CGGTGCGCTT ACGGCGTTGC TTTTTCTTAT TGATCCGGAC GCGAAAACAA 
GAAAACAGAA ACGTGAAGAC AGTCCTACCT CTGCTCCTAC AGGCGGCGTT TCTTCTCCGG 
AGCATGTGGA CGTTCCCTAG CCAGACTGCA ATGACACAAA GTCGCGTCTA GGGCTCGCAG 
GACGGCGCGC GTGTGCGTGT TTGGGTTCTC TGCTTAATGC GTGCAGTTTT TGTCCGATAC 
ACAGCGCATG GTGCTGTCGC GCGCGGTGTG CGCGTCCTTT TTCTTCTTCC ACGTAGCAGT 
TGCCGCGTAT ACGGCGCGTG TCCAGGAAAT GGCGATGCGT GGTTTTGCAT TGCGCAATTT 
TCAGCAGGTG CATGCGTATT TTGAGCAGCA TATTCCGTTG CTTTCTTCGT TTACGGAGAA 
AAAGGAAGCG ctCTCGCTCT TTGCTCAGTA TTTAGAATTG CACGATGCTC ATGAGCGTGC 
GGCACATCGT TACCGAGATG CcGGCGTTGT ATGCcGCTGG GTACTGAGCG CGTGCAGTTC 
TTACTTGAAr CTACGCGTAA tGCAATGGCC c g cGGATGCG CGCGAGTATG CACGGGAAAC 
GTTGGCAGAA GTCGAGCACA TAGGTGTGCA GGTGCTAAAC AAGAAACAGC ATGCTACGTT 
CTTGGTTTAT CACGTGTGGC TTGCGCTCCA TGCGGCGTCT ACGGCCGCGC ATCTCCATGA 
GC AG TTGGAA AGATTGGAAG AGTATGGCAC GCAGGGTGTG TTCAATGTGT TTGAGACGGT 
GTTGCTGTTT ACTCGTTGGT GGATTACTCA GGATGAGAAG GTGGCACAGC GTCTGACAGA 
GAGGTATcCG CAAAGCTTTG AAGCACTTTC GGTTATAGGG GCGGTGGAAA TAGCGCCGTC 
GGTTTTTTGG CATTTGATGC CGCGTGCGTA CGGAGAAGCA GTTGAATCAA TGGGAAAATC 
TGAGACAGTT GTCTTGCAGG ACGCGAAgCT ACGTCCTGTA CCCGAGGTGG TGGCAGCGCA 
CAGGACCCGT CGCGCGCACG TGGCCGCAGA CGGCACGGcT GCGCGGTCTG CTATGTCGTC 
GTCCCATAAT TTGGGCGTGT CGATTCTCGA GGGAGGGGTA TCTGTGCCCG ATGAGGTGGG 
CGCGGGAGAT GAGAAGCCAC GGGGGTACCA GCTCGGGTTT TTTCGAGCAA AGGAAAATGC 
GCAACGGCTG ATGGACGATC TGGAGAGGCG TGGTTTTGGG TTCCAGCTGC ATACGGTCCG 
ACGTGCAGAC GCGGTGTACT ACCAAGTTTT TGTGCCGGAG GATGATTCCG GCTTTGTTGG 
TCACCGACTA AAAGATGCAG GATACGAGAC GTTTCCCCTA TTCTAGGGGG CCGGCACACA 
TCGGTGTTTT AGAATGAGTT CCTGTATAAG GTGGTGCATA AACGCGTGGG GAAGCTGTGG 
ATATGGGGAT AGCGTGGGGA AAACCAGGAA TAAACCCGTG GAATGCAATT GCTCAGCAAC 
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GCATCAGGGC GAAGGAGCAC TAGCCGGACC GGCGTGATAT CTGGTGTATT GACCGTCCCA 4440 

CATCGACGTG CTGCCAATTT TGAGCCGTGC TACGTCTTCA ACCGCCCTTA CGTGTATCAC 4500 

CTCGTGGGAT TCAAATGTTA CCTCAAGGCG GCGGCGGATG CTGGAAATTG CGGTGTTGCG 4560 

CGTGAGAAAT GACACCGTCT TGCGGTGTGT GTGCGCGCTA TCTACCAGGA AGATTTCTTG 4620 

GACTGATGCG CGTGAGAATG TGATCTCCGC GTGTTCTCGG TCAAAAAACA GCAGAGGATG 4680 

GATATCGCCT GAGTTCGGGG AGATTTCAGT TTGTATCCAC AGTCCTGCCA AGAGGTGCTG 4740 

GAGTGCTCCC TGTTCACCAT GTgTTGCACg kTTCC AAAGG GCTATGCGCA TTGCTGTTTG 4800 

TACAGGCGGT TGTGTGTGTG TCTGTGCTnC TCCGCCTACA GGGGCCGGCG GGGCGTTTCC 48 60 

ATGTGACCGT GGGTCTGTCG TGGTGGACGA GCCGGACGTA TTGTCTGCCG TGTCAGACGG 4920 

GTGTGCGGCG GTGTCCCGCG CGTTGCTGGG AAGTGCAGCA GTGGGTAGAG CGCCGAAGGT 4980 

ATCATGCGGG GGAATTCTGC GTGGAGGAAC GTGACCGTGC TGAAAGCAGG AGAAAATATA 5040 

TAGAGCCCAC GCAGCGACGA ACAACGATCC GGCCACACTC AGTAAAGGTC TATTCACGGG 5100 

ACGCTTCCTT GCACGCAGTA CGGAGGCACC AGCCTAGTCA AGCGAAGGGG TATAGCGCGG 5160 

ACT AC TC TC T TTTGCAGGAG GAGTAGGGGT CGGGCGTTTC GAGTGCGCAG CTGCGATGCT 5220 

GCGATACAGC TCCCGCGCCG TGTGGGCAAC GCGGTCGTGC ACGGCC ATAT CCAAGGTGAT 52 80 

TTCGTACCAG TTGTCGTGTT TTACGCGTCG GGCTCTAACA AGGTAGGGAC TTTGAAAGGA 5340 

ACGCTGTTTG CCACGGTACT TTGAGGTGAG CGGTGCGTCG TGCTCAAGGT CGGTAACTAA 5400 

CAAGAGAACC TTGTGTCGGT TGTTGTCTTT TCTTTTTTCT TGTATCTCCC AGACGGTATC 54 60 

TAAAGCGCGG CCGATGTCTG TGTAGCGACC G TTTGGG AC A ATGGAATCGA CAACGGAAAT 552 0 

AATTTTATCT CGGTCCTGCT CAcTGCGTAA GGTGAGGGTG ATAAGTTCCT CAGGCTTTTC 5580 

GTAAAACTGG TAAACGGTTA TCCAGTCGCC TTGGATGGTC ATGGAGGAGA CGAACTCATC 5640 

GCGCACCCAG CGGTGTAAAC TGCTGAACTT TCCTGGTTCT TGCATGGAGC GTGATTTATC 5700 

TATCATCAGG AAGATGTCGA CGGGGACAGT GCGTTCACtG CATGCAGGCA CAGGTGGATG 5760 

AGAAAGGTGC AGAGTGCAGG ACAAAGCGCT TTTTTCAGGT GCATTAGATA CTCCTTTAT 5819 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

TGTGGCCTGG CGCGCTGCCT CATCTGCCTG CGCAAGCTTG AGCgCAAGCG TCTCGTTTGT 60 

CTGTGCACTT CCTACAGCAG GAGCTTTTGC ATACGCGTGA TCAwTACACG CCAGTAATTG 120 

TTGATGCGCA AGCAAATCCA CATAGCGACG CAAAGGACTG GTCACCTGAC TGTACTGAGA 180 

cAGCCCGAgT GCTGCGTGCA CCGCGGCGGT TGTGtCACGC GACGAGCTTT CATCGCGCGC 240 

CGCTTTTTGT ACTCCCCCGC CAATCCCGCT GGTAtTGCAC GGGCAGCTGA GGACGTTCCT 300 

GACTCACATA AGGAAAGGCA AGGTTATGTA GAAAGGCAAA CCGTGCTGCC GCTTCTCCCG 360 

CTAAGAGCAT GAATTCACGC ACCATGCTCA TAGACTCGTA CGACTGCTGT GCTTCAATGT 420 

GAACACGCGG CACCTTTCCC TGTGTATCGG GCACGTCCCC GGCGTTCATT TCCTTTCCCG 480 

CTTGCCCTGT TTCTTGCACA GGAAAATCTA CCCTCATGTG GACATCAGGA AAGCAAATGT 540 

CCACTGCGCC GCGCCCTTTT CTCCGTGCAA TGTTGTTGCG CGCAAAGTCA AAAAGAGGCT 600 

GCAACGCGGG GGTATCGCGC TGGGAATCCG CCTCCGCATA GGAAAGGCGC GTAACACGCA 660 

CCATGCTCCG GAGCACGTGC ACACAGCTGA TGTCACCGTG CTCATCAAGT AAAATTTTAA 720 

AAGACAGTGC AGGAGAAACT GCGTCGCGCG CGAGTGCACA CGTATCAACC ACCACGTCGC 780 

TGAGCATGCG CACTGCGCCT TCAGGCAAAT AGAGCGAAnA CCCCGTGTAC GTGCGCATGC 840 

ATC TGCGTGC GAATCAGGAA GAACGAGCTC TGCAGGGCTC GCTACATGGA TCCAAAAATA 900 

CGTACCATCG AAACTGATCG CATCGTCAGG GTCGCGCGTA CCCTCCCCAT CGATGGCATA 960 

CGCGGCAAGA TGCGTACAAT CTGTGCGTGC TTGAGTGACA CACCTATGTG TGTGCACATC 1020 

TTGCTGCGCC GCACGTTCTC CATCAGAACC AGACACATGA GCAGGACTTA AAAATAGAGG 1080 

ACACAGACGA TCAGGATACG GATTGCGATA CACCGGCCAG AACCCGAAGT GCAAGAGTAT 1140 

TTCATGCGCT TGCTCCCTGT GCTCGCAGCC TAAGGCACAC TGCAAAATCT TACAGCGATT 1200 

CGCGTGCCCC AACGCGAACG CTTCGATTTC CTGCAGAAAG GGCGTAAACT GCTCGTTCAC 1260 

CTGCGACACG TGCACACGAT GCATTCCTTC CTGCGCAGTT GGTGCCATAT TGCGTGCACT 1320 

TCGTGCGACG CGCCGTAGCT CCTGGATGAA CGCCTGCTTT AACGCCTGAC GCGCTTCCTT 1380 

TTTTTCTTCC TGCACTGCGC AGGATGCGCA CTCTTGCTGA GAGCGGATGC GCACTGCGGc 1440 

GGCAGGCTCA TTGCAAACAA AATACGCACT TTGCACACTT TGCTCCCAAT AGGCCCACGA 1500 

CTGCGCGGCA GAAGCCCCCC AGAGGAGCTC TGCAAGTTCA AAAAAGGAAG GAGCTTCGGT 1560 

ACCGAAAAAC TCGCGCGCGT CCTGTACGGA TTCCTCGCTT ATACGCGCAA TGTGAGACGC 1620 

AGACAATAGT TCTACCAGGG AAGAAACAGT GCCCGGGTGC AAGAGCAAAA CGTCTTTCAC 1680 
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GCGCACGcGT CTTAACCCGT GTTCGGTTTC AATGGTGATC TTCGCATCCT TCCCTCGCTc 1740 

GATtAgGTGT ACACACGCAG GGCGCTTTCG ATAGAGCACC GGACTCCCCA CATGTAGTTC 1800 

CATCACGCAC t GCG CCTGC A GTATGCGCTG AATAGTTTTG AGTACGCTGT CTACCGAGTG 1860 

ACTGCAGACA GGCGCACACC ATCCgTCGAA CACGCTTGaA CCACCCCATA CGCACACGCT 192 0 

CAGCACATGC ATATGCCGCA GCATGCAATT GCCGAACCGG CATATCTACG CTTCCTACAC 1980 

GATGGAATAC GAATCCGCCA AAACCTGTTT TAAAACGGTA CAACCCGTGC ATTGGGTGAC 2040 

GCACATCGTC CGTTGGCGGA ATACCGTAAA AATCATACCA AAGACAGnCc GCGCGCACGC 2100 

GCTTCTTGAA TTGCATACCA TTGCAGCGCA TACGGTGCCA TAAGATGGCG TGCTGAATAG 2160 

TCAGAAGCTC CATACACATA AGTTGCGCAC GTGTCAAAAC ACAACAATAC CAAAGCTGCA 2220 

ATTGCCTGCT CATCAGCGCA CCCTAATTCT CTGTCTTCCG ATGCCGGGGG ATGGGGTGTC 2280 

TCTATGTTCT TCGGTGTGTC TTTTCCTGCA ATACGCACCC GCAGTGCTGC ACGCGGAGCA 2340 

TAGGCAAGAC AGAGCACCAG CATCCCCTGT GCTGCAAATG CGGTGCAAAA ATCGCGATAA 2400 

TATTGACGGG TGTGGATGGC AATGCGATCA CGCGCCGCAG TTTTTTGGTA CAGCGCGTAA 24 60 

AACACATCCA CCGCCGCGCG CAGACTACCC GGAGAACCCT CCTGCGCGAG cGTATCAAAA 2520 

CGCGCCaCAC GCACACCGTG CTTTTGCGCA CGTCGAACGT TGTAGCGCCA TTTTGGTTTG 2580 

AAAGCAGCAA AAATATCTTC CCgcGCGGGG CGCATATCCA ACAGCAATGT ATCCTGAGGC 2640 

TGCACGTTAC AAGCAGCGCG CCGTAGTCCA CACGCGTGGA GCTCTCGCGT AAAGAGCTCC 2700 

ATCTCTGTTC CCACCGCGCA aTGCGTAGAG GAGGATGCAA GAGAAGGGAG CGAGCACACC 2760 

GCAGCAGCCC ACCCCCACGG GGGATCAAAC CGCACGAGGA ACGGtTACGC ACGAAAAAGG 2820 

GAAGTAGCGC GCTCGTTAAC TC AC GTAGC A GACTGGCACG TGCGCGCGCC ATCTGCCGTG 2880 

ACGGAATCTG ATCGTCCTGA AGATACGGGG GAGCACCCGG CGCATACGCA AACACGCCAA 2940 

AGGGCTTAAT ATTCTTGCAC AGAATGAGCA GGGGAAAGTG TTTTTCTCCC CCAGTGTTTG 3000 

CATCCGGGCG CACATGCACG CTGAACACGT ACGTCTGCCA GCCGTACGCT CGCTTGAAGT 3060 

GCGCCCACGC AGGACTTTGT AAAAACGTTT CTGCAGTCCA CGTCTCCTGC GTCCACTTTT 3120 

GCACGGTAAC TACGAACATG GGGCACCCAT TGTACTGCTC CCCGTGCACC GGATCCAGAT 3180 

ATCTCCCAAA AAGCTCCATT ACCTGCCGTG CGCTCCCGGT ACGCTCTGTA TGCAGAGGGA 3240 

TACGCTCTCT CCCTCTTGCA ATACATCCGT CCCTTACCCC CACACACGCA GGGGCATgCA 3300 

CAaTGCTAAG AAGCACACAT GAGCACCCTg ACCGTTCACC GAAGAACATG CACAATGGgC 3360 

GAGCCTGTGT GTTGCGGTCG AggTCCGAAG CGCACAGTTC TTGCGCAGAA AGGAGCGCAC 3420 
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CCTATGGCAG TGCCCCGAGC AAATACyTCA AAAGCAmGCA CCCGTAGAAG GCGTGCGGTT 
AATATGCGGC TTGAgGCCCC GCATCTTGTT GAGTGTGGGA ACTGTGGTAA TTTTG TGC AG 
TCTCACCGTG TGTGTGGTAG GTGTGGCTTC TACCGGGGGC GCCAGGTGAT TAACCCTGAT 
GACCTTTGCT AGTGCCCGTG CGAGTGTGCA CCTGAGCGAC TGCCTTTTgC TCGCGCACAA 
GGAGGCTGCC CCGTGGATGA GTTGTTCTTA AGAATGAGGG CATTAGTGGC AGAGAAATTA 
GAGGTGGAGG AGGCGTCCAT CACGCTTGAT TCCTCCTTCC GAGGAGATCT CGGTGCTGAT 
AGCCTAGATA CCTACGAGTT GGTCTATGCG ATCGAAGAGG AGATGGGGAT TACTATCCCC 
GACGAAAAAG CAAACGAGTT CGAAACAGTC AGAGATGCGT ACGAGTTCAT CAAGTCCAAA 
GTGACATGAG CCTGTGTCTC GGTCATATTT TTTCCCGCTC TCGTTCTCCC CTCACCCCCG 
AGCGTAGGGA GTCTCTCCGG CGCCTGCAAG AG AC GCTCGG CGTTAAATTC CGCGATCCTA 
CCGCACTCGA CCAGGCACTT TCTCACCGGT CTTTGTTTTC C TC AAAAG AG GACCATTGCG 
GTGTGCGCCA CAATGAGCGC ATGGAGTTTC TCGGGGATGC CGTGCTTGGC GCGGTAGCCG 
CCGcTTGgCC TGTATCGCGC ACTTCCCGAC AGTCACGAGG GGGATTTAGC AAAGACTAAG 
GCGGTGCTCG TGTCTACTGA CACCCTCTCG G AC ATTGC C T TGAGCCTGCG TATAGACCAC 
TACCTTCTGC TAGGAAAAGG GGAGG AG CTT TCAGGAGGTC GGCACAAAAA AGCCATCCTT 
GCCGaCGCTA roCGaAGCTGT CATCGGTGCG CTTTTTTTGG ATTcAGGkTT CAAGGCGGCA 
GAGCGTTTTG TTCTCCG tCT CCTgCTCCCC CgTgTCCGCC CCaTaCGAGA GAAAAAtTTG 
CACCATGACT ACAAATCTAC CCTCCAGGTG CTTGCACATC AGCGCTaTCG TAGTAAGCCG 
GAGTACACGG TCGTCAAGCG CACCGGACyT GATCACAgCG TACGCTTCTG GGTGGATGTT 
ACCGTTGGCG ATGCACGCTT CGGACCCGGT TATGGCACCA GCAAAAAAAG CGC AG AACAG 
TGCGCCGCTC GCCTTGCATG GGAACAATTA TCCGGCACCC TCCGGGAGTA GCGCGTATGC 
TGCCCTGTAA GaTACTCTCC TTGTCCCGCT CTGACACCGC CCGCCCCTTC GTAAAATGGG 
CAGGAGGAAA GCGCGCCCTC GCCCCAACCC TTTTTGCGCA TATGCCACAG ACATTCGGCT 
CCTACTTTGA GCCTTTCGTG GGAGGGGGAG CGCTCTTTTG GCACTTGTGC GCGTGTACTC 
GGGTGCGCCT ACACGACATC TATCTATCTG ACATAAATTG GCCACTGCTG TGTGCGTATG 
CAGCCGTTCG TGACCGTGTA GAAGAAC TT A TCGTCCGGGT TGGACAGCAC ATCGCCTGCC 
ACACCCCTAC CTATTACCGT CTTGCGCGGC GTAAATTCGC CGTATGCGAG CATCCGCTCG 
AGGTTGCCGC GCTTTTCCTG TACCTGAATC GGAGCTGCTA TAACGGACTG TACCG TGTC A 
ATAAAGCAGG TCAATTCAAT GTGCCTCTCG GACGCGCTGC ACCTGCGTCT CCTTTTCTAA 
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ATACCACCGC GCCTACCCCT 
TACGCATTGA TGAGGAGAAT 
ACTGCCAACA CTTTTCTTGC 
CGTACCTTGC ACCTTCAGTG 
GCTTGCTGCG TTTTGCATGC 
CGATTGCCCT GAGGTACGCG 
TCGGTGTATC GCACGATCCG 
TTACCCCTGC GCTGACACGG 
CGCGTATTGC GGCGTTTAAT 
GGACC CGCGT GCTC TGCGAT 
ATAAAACGGT AGAAAAACGT 
GGAACCGTGC CACATCTTAC 
TCTCAAAATG CAGTCTGCTT 
AACTACCGTC TCTTTGCACA 
GTGCGCGCAG GAAGCTGCGT 
TACTTTGCAA TAGTAAGCCT 
CAGaGCTGCG CCGCTGCCTC 
AAAGAGAAAC ACTCGCCCAA 
CAAAAGACGG GCAGGACCTT 
TGGAAGATTT CTCCCTTGTC 
CGCACTGGAA GAATGCTGGA 
CCGGAGGCGC TGGCACTCCT 
CCGCGCGATA TGCACAGCGT 
TCCCCCTTGC ACACTTATTC 
GCCTGCGTGT GGTATGCACC 
TGCGTAAAGC CGACGTTGCT 
CGTTCGCGCC GCCCCTGTGC 
TTGGTTCTCT GGAGCGAAGA 
GAAGCGGTGC TCTTTCACGG 




378 

CGCAGTACAC AGCCTGCGGC 
TTACGCAGCT GCGCGCGTGC 
ATTCAACCTG CACGAGGAGA 
CCTATGATAA AACCGGTTTT 
ACCTAGACGC GCGGGGAGTT 
CATGGTATCG TCCATTCCGT 
CTCACGcAAG GGGAAAAAGG 
CTACACCGTA GCTTTCTGCA 
GCCACTACAG AAGTTTTACG 
AATGCTTCGT ACACACTGCA 
AACAAATGCA GATACTATGC 
CTTTGAAGCG GCACTCAGAC 
CCTAGGC CAT GCTACGGACG 
GCGTGTGCGC CGTGCACGCC 
TGCGCTCTTT GGCCCCAACT 
TGGTGCCCGC GCAGTCCCTC 
CAGCATGCTC ACGTTTGCTG 
GCGGATACAC TCACCGATCC 
TCTACCGTAT CGCACACCGC 
TGCACAACGG ACGGTGTACA 
TCAGACCCGG ATGCCATTGC 
CcCCGTGCCG TAACATTTAC 
GTACTGCGTG TACGCACGCA 
GAGTTCGTGT GTGCGTTTCT 
TCCACTCCCA CAGATGCGGT 
TTTCTGTCTC CCACCTTTTC 
TCGGCAGCTC CATACCCAAC 
GTGCAGTGAA CACACGCAGA 
GTATCTACAT GCGAGCGTGC 
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GCAGGTCGGa CACCTTGcAA 5220 

GCTAGCAAAC ACCACTCTTA 5280 

TTTTGTGTAT CTCGATCCAC 5340 

GATAGAGCAG CGCACGAATC 5400 

CTTTTTATGC TCTCAAACAG 5460 

GTGCAACAAC TCAACGCCCC 5520 
TGCGAAGTGC TTATCACCAA . 5580 

CTCTCCTGGC CGTATCGCAT 5640 

GTCATAAAAA CCATCCGTGG 5700 

CGTATGACGT AGTAAAAGAT 5760 

CCGCCATGTA CAGTCGAAGG 5820 

ACTGTGCCCA GCACTTTGGA 5880 

CGCATTCGCG GTGCTGCTTG 5940 

AGTTGCTGAT GCGCTGTGGT 6000 

GTCCACAGTG GGGAGTTAGC 6060 

TCGTACCAGA GCTCAGTCCg 6120 

TGTCATTGCG GGCGCTGCAG 6180 

GGACGCTGCT TCTTGcTCCG 6240 

GCAAAGAACA CTGATCGCTC 6300 

AAACACTCCA GTACCTGTGA 6360 

CAGCGTGGTG TACACCAGCA 6420 

CCAACGGAAT TTACTGTGCA 6480 

CGATGTGGTT TTTTCGCTCC 6540 

TGCAGTTTTT TTTACAGGGT 6600 

TGCCACTGCA GCAATTGCAA 6660 

TGGAGGCTTC CGAGCAGCTG 6720 

TTGGAGGACA GcTGCGCCTC 6780 

TTTTGCACCG GATCTCGCTG 6840 

TCATTTTTGT GACCGCAAAG 6900 
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AAAAATACAA AAGACAAAGA CATGCCGGCG CCACGCACTC AG Ac GCGTGC 
TTAAAAACGC GCCTTTTAAA CTACAACGCG TTCACCAGCA CCGGGGAACT 
GGAGAGGGTG TAACACCCGG CTATTGGCGC GATGAGGTAC GCACACGTGC 
CCCGACGGCT GGCTCCGTAC AGGGACGCTT TGGACAAAGA CAGAAACCGG 
CCCTGCAGCA GCTCGTGCCA TATGCAACTC GGTGCGCGCG GAGAAGCGGT 
GATCTTGTTT GTGTGCTTAT GCAACATCCg TGCGTGGTGC ACgCACACGT 
ACGCAAGGGC AAGCGCACTG CGC CGTATGG GTAAAACAAG GAGCCGAACG 
CCTCAGATAG TGTGCTTTTT GCGCACCCAG CTTTCCACGC AGGTGCGCGT 
CGGGTGATGG AGTATGAAAA AACAGTAGGT GCTGACCGCT TGTCTCGACG 
TTCCTAAAGG AAAACTCTTT GTAGAAGGCG CAGCTCGCAC GTTAATGTAC 
TCcCCTGACG GAAAAAAAGG TACTCCACGT AGTGTTCTTC GATGGTTCGT 
GCATATTTAG GCGTAGTCGC GCCTGGGCAT CTCTTTTGCG TATTATAAAC 
AGGAGCACAG ATCAGCTACA CGCTGGTATG CGTGCGCCTG CGTGTTTTCT 
TTTCCCCGAT CAGGGCGTAT GCTTCCTGAC GGCTCAGTGC CGCAACCCCt 
TTGCAAGCGA ATGGCTTGGA TCTGCTCATG TGAAATACCA AACAGGAACT 
GGTACAAACT CCtGtTCGCT CAAGCGCTGC ACTAAAAAGT CGCGCCACAT 
TCTGTCGAGA GCAAAAACAA CTC AC TAGAA CCGAGCATTG AGCCAAAGCT 
TGCTCCCGTG CACACCACAC CTGATGCAGT GCCGACGTa G GCGCCGCGCA 
GACGCTGTTC CCACAGCGCT ACGAGCGCAT GCGTAACAGC AGTCTTCAAA 
CGTGCTCCTG CTCCATTAGA GAGAGGTATA GATCCTCTGT CATGAGCATA 
CAAACTGCAC CTGGCGCAAC TCCTGCACCA GGTACTCAGC TAGAGCTACC 
CCTGCGTAAG AAACGCAAAG GTTTGGATTT TAGCAATCAG GTACCCACGC 
CACGTGCAGG TAAAAAGAGG AACCTTCCTG CATCGGCCAG GTCTGTCACC 
CAAACCTTTC TACGTCCGCA CCATCTGCAC CTGAGTACGA ACGCATAGCA 
GTTTTAAATT AGACACGCGC CGGCGTACAA CCTGCGCACG CCGAGGCTGC 
CACACAGCGC GCGCGAGACG ATATACTGTT CATGTCTGCT AAGAGCGGTG 
CTGCAGCCTT TATGGGAAAC CCCGTCGTAG CCGCGCCCAT GGGTGAGTCT 
GACACGAATT AC T AG AATGA CCTCCCTACA AACCAAGGGA AAGGAAGGGC 
AGAATGACAG GCGCACGCAC GGCAGGAATA ATCAATTCCA CACGTATCTT 



PCX 

GGTGCCCGGT 

TGCCTTACGG 

AGCGTTTACT 

TAATCTCCTC 

GTACGCAGAG 

GCGCGTAGAC 

AATACGGCAC 

GGGGACGCTT 

CGACACACCT 

AAAACAGGCT 

ACAGGTCCAG 

GACCTATAAA 

CCCCCGCCCT 

GCGCACGCAA 

CCTCTAGTGC 

CTCGTCTAGC 

AGGGGCCAAG 

CGTTCATCCG 

CGCGCGGGTA 

CACACGAGTG 

CGCGTTACCT 

CTAAGCGCCA 

GTTCCCTTTC 

CGGACCAGAC 

GTGCgCAAAG 

AAGTGTATGA 

AATGTATTCT 

CTAACCGCTG 

CTCCTGAATT 
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CAATCATATA TTTTTTACCG GCATACATAG 
ACGCGTGCAT ACACTCTCGG CATGGAGACA 
GCCGCGGGAC TGTC CGCAGC GCAGTACGCA 
GAGAGCAAGG CACACGGTGG TCAAGCATTG 
TATGCAACTC CTATCAGTGG CTTCGAGTAC 
TTTGGGGCTC AGATTGCTTA CGAAGAAGTT 
CATTACCACG GGTACGGGAG CATATACGGC 
GCATCGCAAG ATGGGCATCC CGGGGGAGAG 
TGCCACCTGC GATGGACCCT TCTTTAGAAA 
CGCTGCGTGT GATGAATCGC TAGTACTGTC 
CCGCAGGaCA CTCTGCGTGC AC AGAAGGC C 
ATTGCCGTTC AATGGAACAC TACGCTTGAA 
GTTCTGCTTA AGGATGTTAA GACGGGAGAA 
TTCTTCATCG GTATGGTTCC CATCACCGGT 
GGTTATATCG TCACCGACGA CGAGATGCGT 
GATGTGCGCG CTAAGTCTTT CCGGCAGGTT 
GCGCACGCCG CCGCGAGTTA CATCGACACA 
TTCGGGTGTG CGTTTTTTAT CCTTCGAGGG 
CTCTTTCTGA GGAAGCTTTG GAGCTCGCGC 
GTCTGGCGGG GTCG TGCGC A GGTGTGCACT 
CGCTTTCTGT CTCCTCTGTA CCTCCGTCGT 
CCTCGACGCG CATACCATTG ATGAAGCCCT 
TGTCAGCTTG GCGGAATATC TAAAAGCAGC 
TCCCTTTATA CGGGGCGGTA GCGTACTGTA 
GcTCACGCGC GCAGGGTTGC TTTTGCACAA 
CTACCGTGCC TGCAAACGGC TGGTAACCGC 
GCACACGTGC GTTATCGCAC GCTCGATTTC 
AT ATT AC C T A CTGGTGCTGC CACTACACAC 
TCCATTGCAT ATCCGGCGCG CACAGCTAAA 
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GGCACCTAAA AAAGCGTTGA CTAATCAGTT 8700 

GATTACGACG TTATCATCGT AGGCGCTGGG 87 60 

TGTCGCGCCA ATCTCAGGAC CCTTGTGATT 8820 

CTTATTGATT CGTTGGAAAA CTATCCGGGT 8880 

GCGGAAAACA TGAAAAAGCA GGCAGTTGCC 8940 

ACCACTATCG GTAAGCGCGA TAGT t TTCCA 9000 

GATGTCTGTT ATTCTTGCCA CCGGTGCAGA 9060 

TGAGTTTTTA GGCCGTGGCG TTTCCTATTG 9120 

CAAGCACGTG GTGGTCATTG GTGGGGGTGA 9180 

TCGCCTCACC GATCGGGTGA CGATGATTCA 9240 

ATTGCAGAGC GCACACTTAA AAATCCACAT 9300 

GCGGTACGTG GTGAAACGAA AGTTTCCTCC 93 60 

ACGCGAGAGC TCGCGTGTGA TGCTGTTTTC 942 0 

CTTTTGCCCG ACGCAGAAAA GGATTCCACC 9480 

ACCTCTGTAG AGGGGATTTT CGCTGCGGGG 9540 

ATTAC TGCT A CTTCGGATGG TGCCCTTGCC 9600 

CTCCAAAACT AAAAC TGCGC GTCTTTGC AC 9660 

GAGGGTACTG TTCTCTCTCC CCATCCCCAA 9720 

TGGCTGCGAC GGTGTGTCTT TTGCAAAAGG 97 80 

CGATCTTCCA GGAGCCTCTC GGGCCATGCA 9840 

GGGGGACATA GTGTCTCGTC TCGTTCGCAA 9900 

TGCCTTCGTT AAGTCGCGTG AAGCATTCAG 9960 

TAAATGTTCT TTTTCCACGC GTGGTACCGC 10020 

CCGAGATGCA GAACCGTGTG CAGTACTCCT 10080 

CAGTGAGCCA AACACAAACA GTGCGGCGAT 10140 

GCAGGTGCGT TCGATTGTAG GTACAGAAAT 102 00 

AGGTATTACA ACTCACGCGC AGCAGGAGCG 10260 

ACCACGTGTT GAATATGAGC AAAACAGTGC 10320 

AGATATGCGA GAGCTATTTC CTCTCCATAT 10380 
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GCATTACAAA CGTGAAGAAG TACTGCCCGT AGGAAACAGT CCAAAACATA AGGCGACCGT 
GCGTACCCTG CACGCACATC TCCGTACACG CGTGATATTC CACGCCTCAA TAGGAGGACA 
CATCGTTGCA AAGGCGCAAA CAAACGCACA TGGCTTTCAT TGTCACCAAA TTGGCGGGGT 
ATATACGGTG CCTGCATATC GCAACCGGGG C ATTGC C ACT GCATTGGTTG CAACGCTTGC 
GTATAACCGA CTCGATATAG GAAAAACACC GGTGCTTTTC GTAAAGGTAC GTAACATGGC 
AGCGCGGCGC GTATACGAAA AGATCGGCTT TACGCTACAC GG ATT AT AC C GCGTCATTAA 
TCTATAGCGA AgCAAACAaT AAAGACGTTA AAAGAGAAGA ACACGGCAAC GAAAGGGAAA 
GACAACGTCG CGTGACCTCT ATTTTC C AAA AAAGCTACGT TGACCAGCCC TTACCCATCG 
CCCAGGGCCA CGTGACAGAA CCGGAGATGC AAGCTGCGTT GCAATCCAtG CGTATAGGAA 
TCGTCGTCAA CTCCGTCAAG CCGTATTAAC AATGTCTGCA T AC GCGGTGC ATCCTTTCGC 
GCAAGACCGA GCACGCACTC CTCCGCCATG CGCACAAAAG CGGCAGACGC AACAGATACC 
ATAGCAGACG CACAGTGCGC GCTCGTCTCC CGGTGCACAA ACAGAAGTTT CCCACGCGCC 
TGACTGACAa AATGCGCAGT CAGCATGTGC ACCAATGCCA TATTTGCCAC GATCAAATCC 
ACCGAGATCC GATCGATACT GGAAAAATCA TCTCC CGG AT AAAGATCTAC ATAACTTTGC 
GCGTCAAAAA CGAACACCGC TGTCTCAAGC ACAtGCCTAG tTTTCAaTCT GAAGCAGAnT 
ACGCGCAATG AAAAGGGAGA CGCGCGATTC CAATAAaCCA aCCCATCACG CACAGGCACA 
TCCCCACGAG CTGCCTCGTG TGCGCTCAAG CACACACGAC ACCCCTGATT ACGTAAAACt 
CTGCGAGACT ACGAGAAAAC TGAAGGCGAT TATCGCTGAC GAAAACTCCC GCGCCCATAC 
GCGGGAGTAT AAAACACATC CTGGCGAAGA TAAAAGCGTC CTCACAGCAG AAAGACCAGC 
AACCATTCCG CCAGGAAGTA AAGAAGAAAC GACACGGAGT CCTGCGTTCT CTGCCCCGTA 
CCCCAACTTC TTATAGAGTT CCTCGCACCT TACAAGCTAA CAACCAAACC GCTCAAAGGT 
CTTTGCCCCG TAGTAGCGCG CCTGTGCACC CAACTCTTCC TCAATGCGCA TCAACTGGTT 
ATAtTCGCCA CCCGGTCACT GCGACTCATC GAGCCGGTTT TGATTTGACC TGTCTCAAGT 
GCCACTGCTA AGTCTGCGAT AAACGCATCC TCTGTCTCAC CCGAGCGATG TGAAATCACC 
GCCGCGTAGC cTGCGTTCTG AGCCATACGC ACCGCGTCGA CAGTTTCTGT GACCGTGCCA 
ATCTGATTAA G TTTT AT C AG AATCGAATTG CACGATCCTT CTTTGATACC TCGGGCCAGA 
CGCCCAGTGT TGGTTACAAA AAAATCATCT CCCACAATTT GGACTTTGTC TCCCAACTCT 
TTCGTGAGCT GCACGTAACC TGCCCAGTCG TTTTGGTCAA GCGGATCCTC GATAGACACA 
ATCGGATACG TAGCAATCCA CTTCTTGTAC AGATC AATC A TTTCCTGTGC TGTGAACAGC 
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TTCCCCGGAT TCGACTTCCA AAACTTGTAC CCTCTCCGAT CTCCTTCATC GAATAACTCA 
GAAGACGCAC AGTCAAGcGC AATACACACA TCCTTCCGCG GCGCAAGGCC GGcTTTTGCG 
ATCGCTTTCA TAATGTACTC AAGGGCTTGC TCGTTATCCA AATCAGGCGC AAAACCACCT 
TCATCACCAA CTGwACGTAr cTTTGCCGTC GGCGGCAAGC AGGCCCTTTA ACGCGTGGAA 
CACCTCTGCG GTCATGCGCA CCGCTTCGCG CATGGACGCA GCGCCGATGG GCATAACCAT 
AAACTCTTGA AAGTCAATTT TATTATCAGA ATGCTTCCCC CCATTGATAA TGTTGGCCaT 
AGGGACCGGC ATG CGAAAAG TGTGCACACC ACCGAGGTAA CGGTAGAGAG GAACACCCAG 
AAAGTCTGCA GCAGCACGCG CACAAGCCAT GGAGACGCCA AGcATAGCAT TCGCACCAAG 
CTTTGACTTA TTGTCAGTGC CGTCCAGATT CCGCATCnsG TGATCTATCT cACCCTGGTT 
GAGCGCATCC ATACcTTCGA GCGTATCAGC AATGAGCGTG TTGAmAGTTC CAAcGGCCTT 
GAGAAcACCC TTACCGTTAT AgCGcTCCtT GTmtCCATCA CGCATTTCGA GCGCCTCGAA 
CTCTCCgGTA GACGCCCCTG AAGGaACACA CGCACGGCCA AAGCTACCGT CAnTGAGCGA 
GACATCCACT TCGACAGTGG GGTTTCCCCG AGAATCGATG ATCTCGCGCG CTTCAATGCA 
TGCAATGTCA CTCACTTAAG ACCTCCTGAT GTGGGCGCAT GGTAACACGC GAAAAAAAAT 
GCGTAAAGGT TTTCCACTCT CTCTATCGCC CCCGCACGCC GCGCTCCCTT CCCACTGAGA 
ACACCACAGA GAAAGTAACT GTACCAACCG CCTCGGTATT CCAACGGAGT ATTGCAaCGG 
CGCGTTATAT CTATTC TCGA TACGCAATGG GAACCATCAA TTGATCACGA GACAGAATAG 
TGTGCGGGAA AACACGCTGC GCTTCCTTTA ATAAACGTTT CAACTCGTGA TCGGTATATC 
GAGGGCTATA GTGGATGAGT GCCATAAGTC GCACACGCGC ATCGcGcgCT aTCGTGGCTG 
CCTGCACGCA CGTCATATGC TTTTTCTCTG CTGCATCCTT TTCCATCCCT TTCTCAAACA 
TTCCCTCACA CACAAAGAAA TCCGAATTCC GCACCTCGGc TGCAATGGAC TGCAAATATT 
TTGTATCAGT GACGAAGCTC ACCTTACGCC CCGGACGCGC CGGTCCCATT ACCTGTTCAG 
GATATACTGT CACCCCCTGC GCGGACTGCA CTGCAACCCC TGACTGTAAC TGAGACCACA 
GCGCCCCACA GGGAACGTGC AAATCCTGAG CCGCGCGCGG GTCAAATGAT CCGGGACGAT 
CCTGCTCTTC TAGCGTGTAG CCCATACACG GCTTGGTATG ATCCAGACAA AAACAGCGCA 
CCTGAAAATC CTTACCACGG TATACCACTT GTGGTTCTAT CACCTCTTTG ACAATAATCT 
CGTAATTAAT GTACATGTCC AAAATCCTGC GGCTCGTTTC C AC AT AC TC T GCAGTTCTTG 
GAGGACCGAT GATGTACAGC GGTTCGCTGC GAGCAACTTG AGAAGAGAGC ATCAAAAGCC 
CCGGCAGCCC AGTGATGTGG TCTGCATGGG TGTGACTGAT GAAAATGGCA CTGATTTTCT 
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TCCAGCGTAA CTCAGACGCC GCAACGACAC TTGGGTACCT TCCCCAGCGT CGAACAGAAA 
CAACTCTCCC TCACGACGCA ACAACACAGA AGTCAGATGC CGATGGGGTA ATGGCACCAT 
GCCGCCACAC CCTAAAATAA ACGCTTCAAG ATTCATATGC ACACCATTCC GACGTCAACC 
TTTTGGAGGT TAACTCAGGC GCCTGATGGA TTCTTGTGCC TCTTTGTACT CAGGACGCAA 
ATGCAACGCA CGTCGGTATG CATCAAGTGC AAAGCCTTTG TCTCCTGCGT ACTCGCGCGC 
AAGTCCCAGG CGATACCACC ATAGCGCATT CGACGGTTCA AAATTCACCG CTGTTGAGTA 
CGCGATGTCT GC C TTCCGGA AGCGTTTGGT GAGGCGAAAG ATCTCTCCCA AATAAAAGTA 
TGCCACACTG ATACGCTCAC CGCGCGGCGC CATGTCAATA TAGCGCTCCA TAGCGTGCAT 
GGAACCCTTG TAATCGCCGA CAAAAAAGAG CGcCTCTGCG AGAGTTTCCA CCACGCGGTG 
ATCGACCGAA ATTTTCAGCG CCTCCTGGCA TAGGGCAACC GTATCTGCAT AACGACCAAG 
ACGGAAAAGA GACCAGGTAC ACACTGCATA CGCGTCGGCG TGTCGCGGAT CGCGCTCAAG 
CACACTGCGA CAAAGCTCAA CCGCCTGCGT ATACATCTTT TGTGCATCTT CACGCCCACC 
TGAAGTGTCC ATGtACGCCC ATTCCGGTAA AG AGAG AG TG CCTCTCGCAC CTCAGCTGCT 
CCCGCCTGTG CAGCAGCAGG AGGTTGCTCC TGCGCACTGC CACGTGCAAG AAACCAGACA 
AGAmCGma TG CCCCCGCTAC TGTCCGTGTT CGTGTAAACA CAGCGCCTCC TTCAGACACA 
TCGAAGGCTC CCGCACAGAG CGCACGCCCT TTATGAAACG CGACGCGCAC ACGGGACACC 
TTCTTTCAAA AGACACACCC ACACCATCCC CATCCTTGAG TATGCAGAAG AACCGTCAGG 
ACTGGGTAGG TTTTAAACGG AAAGAACTTG CACCCTACAA AGCAGGCGCC ACCTCCCCAC 
CCTTGTAGCG TTCCTTCAGA TACGTACGCA CGCGCCCAcC ACACAACGCA CGGAGCACCG 
CCTGCACGCG CGCATCAGCC TCGTTTCCTC GTTTTACCAC CAGCACATTC GCGTAGGCTG 
AGGCATCAGG TTCCACTGCA AGCCCGTCAC GCCGTGCAGA AAGACCAGCC ATTATTGCGT 
AATTTCCATT AATCACCGCA CCATCTACCT GATCAAAGAC GCGCGGCAGA AGGGCACTTT 
CCACCTCCTG AAGTACCACA TTGCGCACAT TTTGCTGCAC ATCCTCTACT GTGGCAAACA 
GTCCTGAACC CGCACGCATC CGAATGAACC CTGCTGCTTC CAAAAGTCTG AGTGCACGTG 
CCTCGTTGGA CGAATCATTT GGAATGGCAA TGACCGCGCC GGCGGGGAAA TCACTCACAT 
GCCGATACGT TCTAGAGTAT AACGCCAGTG GCTCTACGTG CACGTTTCCA ACACTTACCA 
GGTCCCCGTT GTGCTCCTGG TTAAATTGCT GCATATGGGG CACATGCTGA AAGAAATTCA 
TCAGAATATC CCCCCGCATT ACCGCCTCGT TCAGCGCCAC GTAGTTTGTA AACTCTACAA 
TACGTAGTTC GATGTGCTGC TTCTTCACTT CTTCTTTTGC GATCTCAAGT AAGCGCGCGT 
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GCGGTTCAGA CAGCACCCCT ACCCCCACCG TTTCATCCTT CACCTGAGTA CACGCAACCA 15660 

CCCCTACGCT TAGGGCAATG AGTTTCCCTA CGAGCGCAGC GCTcACCGTT TTTCCTTTCA X5720 

TCTCATTCGG CCTCCCCCTG TCCCTCTATC CATCCTGTCA ATGTCCAAAG AAAAAGCGCA 15780 

CATC AAG ACT CACCCTGCAT TTCACACCGT TTCCGCACGT GGCGCATCGC ACGCCCCCTG 1584 0 

CACCCGTACG TCCACATGCG GAAAGGGAAC CTCAATGCCC GCCTGTTTGA AGCATTCGTC 15900 

GATATCCACG AAGATAGCAT TGCGCAAATC ATTGAAATGC TCAATGTGAG TCCAGGTCAG 15960 

GAGCGTTACG TCAATACCCG AGTCAGCGAA GGCATTCCAC AAAACCGCCG GCGCAGGATC 16020 

CGAAAGCACA AACCGGTTAC GTGTCGCAAC ATCTAGCAAG AGCTGTTGCA CCCGGCGCAG 16080 

GTCACTTCCA TACGCCACGG AAACTTCCGT TTTCACTCGG CGATGAGGAC AGTGCGAATA 16140 

GTTAACAAGG TTCGCTTTGA GGATCGTTTC GTTGGGCACG CGCACATACT GCCCATCGAG 16200 

CGTTTTGAGC GCCACCGAAA GCAAATTAAT TGACTGCACT GCACCGACGA TACCGTCAAT 16260 

TTCTATCACG TCTTGAATTC GAAAAGCACG TTCGGTCATG ACAAACAGCC CTGATATGAC 16320 

GTTTGAAACC GACGTTTGCG CCGCAAATCC AAGCGCTACT CCCGCTATCC CCGCGGCCCC 163 80 

TAGCAG CGCG CTCACGTTGA TCCCCAACCA GTGAAAGGCG GTAAACGTCA TCACCGTGAA 16440 

CGAGAGATAG TTTAGTGTTT TGAACACAAA ATGCTGCGTC TGCGCGGATA ACCGCCTTGC 16500 

AACAACACGG CGCACACCGC GCCTCAGCAT TCGAAAGAAG GCAGAGGTGA TGCACAGCAC 16560 

GGcAACGAAG CGCAGGAGAT ACCACACGCG CTCTGACGTC GCAACCTGCG TGAGGCCCGC 16620 

GCCCAGcGCG CAAGCAGCCG TAGCAAGAGA CTGCACGAAA TGCTCCAATT CTCGCATATT 16680 

CCCCGCTTGT GCACGCCTCG ACACCACGCC TGCATCCTAC GCAAGGGCGT ACTGCGCGCA 16740 

AATATGCCCC CAATCTCCTG CTCCCCTTCC TAAAAACAGG AGAGCACTAC ACCCCTACTT 16800 

ACCTGACCAC ACCCCGTGCA GGTTACAAAA CTCATAGGCT TCGAGCACCT GATCGTCTGC 16860 

TGTCAGTGCA AAGGTCACCT CAGGCGCTCC ATCTACCGGA AGTTCCTTGA GCTGAATACC 16920 

CTTCCGGGTC TTGAGGCACA CCCACGCAAT GTAATGCTCC GGCGTCATTG GGTGAGCCAC 16980 

ACTCCCCACC TTCACCTTTA CCTCGTGTCC GTGCACTTCT ACCACGGGGA TATGCTTTTC 17040 

CTTCGCTGCA TCCACTGTAC CTACAGGCAC TGCACGCAGC ACCTCACTGC CGCACGCGAC 17100 

ACTACTGCCT GCAGGCGCAT CCATACCGAG AAAAAATCCC GCACTTTCCT TCTGCAAGAA 17160 

AAACGACAAC TCCCGTCCCA TGGCAAGCAT CTCCTATGTT GGTCTGATTT TGTTCTCGTG 17220 

CGGACGCCTG TCGTGCCTCC GCGTGCGGAA ACCGCCCGCG CCAGAGCACA GCCCCGCAGA 17280 

GGCGCTGATT CTACCTAAAT CAGGTCCGTG GGTAAAGCGT CACCCACCTG TACGCATTCA 17340 
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GTAAACTCGC TCCTTCCGAG CCTGTGCTAC GCACGAGTAT AC CTTGAC CA TTTGGGTAGT 
TTCCGCTACG CTCCTGCCAG CTTGGATTCC TGAGGAGTTT CTACGTGCCC TGCGGAAAGA 
AAAGAAAAAA GCAAAAGATA GCAACCCACA AGCGCAAGAA GAAGCTTCGC AAGAATCGGC 
ACAAGAACAG GTAGTCTCGG CCCGTGCGCC TTCTGTTCGA CATGAGGCGT TTTCGTCATG 
GATTTGTCGC TGcTTCGCTC CCTCACTGGG CCCCACGATC TTAAGAGTCT CTCCCCCGAG 
CAGGTGCGCG CGCTCGCgcA GGaGGTACGC CAGGAGATCT TGCGCGTTGT CAGTGCCAAT 
GGTGGTCATC TTGCCAGTAA CCTCGGTGTC GTCGAGCTCA CCATCGCACT CCACCGCGTC 
TTTTCCTGTC CCCAcGACGT TGTCGTTTGG GATGTCGGTC ATCAGTGCTA CGCGCACAAG 
CTCCTCACTG GACgCGCAGG GCGCTTCCAT ACCCTCCGCC AGAAGGATGG TATTTCGGGG 
TTCCCGCGGC GCGATGAAAG CCCGTACGAC GCTTTTGGTA CCGGTCACTC TTCCACGGCA 
CTTTCTGCCG CAAGTGGTAT CCTCAGCGCC CTACGATACC GGGGTAAATC AGGTAAGGTA 
GTCGC TGTCG TAGGAGACGG CGCACTCACC GCGGGCCTCG CcTTCGAGGC CcTCCTGAAT 
GTGGGCCGTT CCTGCAGTGA TCTCATCGTC ATCCTCAACG ACAACAAAAT GTCCATTAGC 
CCCAATACGG GGTCCTTTTC CCGCTACCTG AGTACCTCAC GGTAAAAGGT CCATACCAGA 
AGCTCCACAA ACTTCGCCGC GCGCTCCAGA CTGTCCCACT CGTCGGTCGC CCCGCCTGCC 
GCGCCCTCAG CCGCCTGAAA CGAAGTGCAA GAACGCTTTT GTACCAGTCA AATATTTTCG 
CAGACTTTGG ATTCGAGTAC GTCGGTCCCT TAAATGGACA CCATATCGAA GATCTTGAGC 
GCGTACTCAA CGACGCTAAA AAACTCACCC GTCCCACTCT CCTCCACGTG CAGACTGTAA 
AGGGAAAAGG CTACCCCTTT GCGGAGCAGa ATCC TACCGA TTTCCACGGC GTAGGACCGT 
TTAACCTTGC AGAAGGAATA GTAGAAAAAA AGGATGCGCT CACCTTTACC GAAGCCTTCT 
CCCATACCCT CCTAAATGCA GCGCGTACTG ATGACCGTGT TGTCGCTATC ACCGCTGCTA 
TGACTGGCGG CACCGGGCTT GGATTGTTTT CCCATATATA CCCTGAACGC TTCTTCGATG 
TTGGCATTGC TGAGCAACAT GCGGTCACGT TCGCCGCAGG cTTGCATGCG CCGGCGTAAA 
ACCTGTCGTT GCCGTCTACA GTACGTTTTT GCAGCGCGCC GTTGATCAGG TTATTCACGA 
TGTTGCTGTG CAGAATCTGC CGGTCATTTT TGCGCTTGAC CGCGCAGGTG CCGTACCCCA 
CGATGGGGAA ACACACCAGG GCCTGTTTGA TCTCAGCATT CTTCGCGCTG TTCCGAACAT 
AAACATCCTG TGCCCtGCGT CGGCGCACGA GCTTTCGTTG CTCTTTGGCT GGGCGCTTGC 
ACAGGACACC CCCGTAGCTA TCCGCTATCC TAAGGCGTTA TGTCCACCTG AAGAAGACGG 
ATTCAGTACA CCTGTACATA CCgGACGCGG CGTCCTTATC ACCCGAGAGA ATGAGTGCAA 
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TGTACTGTTA GTGTGCACAG GGGGCGTTTT TCCCGAGGTA ACCGCTGCGG CCAACACTCT 19140 

TGCGCGAAAG GGCATATTTG CAGATATCTA CAACGTGCGC TTCGTAAAGC CGGTAGACGA 19200 

AGATTACTTT TTAGATCTTG TAGGTCGCTA CCGTTCCGTT CTTTTTGTCG AAGACGGCGT 19260 

AAAAATCGGA GGAATTGCAG AAGCGCTCCA GGCACTCTTG AACACCAGGC ACCCGGCTCC 19320 

GTGCAGCGAC GTGCTTGCTT TTCAGGACAT GTTCTACCCG CATGGTTCGC GCGCGCAGGT 193 80 

ACTCGCCGCA GCAGGCCTTT CTGCACCGCA TATTGCCGCA CGCGCAGAAT GGCTGTTAGC 19440 

CCATTCAGTT GGGCAGATTC GGTGAACAGT ATGCATCTGC ACGCCGTTCG TTACAtCCGy 19500 

sTGCGctTGC AGCGCATGgG CAGATGGCCG CCATACAGCG GAAGGAACGG GGAGGCCCCG 19560 

CCTGCTCACG CCAGGCGCCG GGGGACCGCA TCCGTTTCAA TCGGCGCACA CGCTGCCTGG 19620 

AGTCAGAACA TCGTGCTATT TCTTAGAAGT ATGGTCCTGT GGTACGcAGC GTACGTTCGT 19680 

CCGCTTTTGG ATGTCGCGCT CCTTTCCTTC CTCCTGTACA AGACATACGA GATACTTGTT 19740 

AAAACACAGG CAGTCCAGTT GGTGAAAGGC GCCTTCTCCA TTCTCGTACT CTACGCTTTG 19800 

GTTTTCGTAT TAAAATTAGA AACGCTCCTT TGGATTCTCA ATGCAACTGC CCCGGGCGTG 19860 

GCTATCAGCA TTACTATTGT GTTTCAGCCG GAATTAAGAA AAATTTTTTT GAAAATTGGA 19920 

GAGAAGAACT GGCTCCGACA GCGCGAATGC GCmACCATAC GCACATCGAC GCGGTATTAA 19980 

CTGCCGCAGA TGTTCTTTCT AAAAGGAAGC GCGGCATGTT GGTAGTATTT GCCCGTCACC 20040 

ATACCGTGCG CGAGGTCAGT GAAAcGGGTA CCGCGCTGTA CGCGCGCCTT TCATCCAGCC 20100 

TGCTTGTGAC TATTTTTGGC CACGATACCC CCATGcACGA TGGAGCAGTC ATtGTGCGCG 20160 

ATGGGCTCGT TGTCTCTGCA GGCTCCTTTT tGCCGCTTTC TGAACAGCAC GATATTAGGA 20220 

AAACGTTCGG CACACGTCAT CGTGCCGCGC TTGGTATGGC TGAAAAAACA GATGCCATTA 20280 

CCCTGGTCGT GTCAGAAGAA ACGGGCGCGC TCAGCCTTGC CTACGATTCA AAGCTGTACT 20340 

ACGATCTTCC GCACGCGGAC GTATTGGCGC AnTCAAACAG TTACTCGAAA CTACCACTCG 20400 

GGCTGGACAC GCTCAAGGGA CACTGGATCA TGGTCGCAGC ACGTTGTCTT GATAGGATTG 20460 

CGCACAATTG GGCTGCCAAG GCATCGAGCA TACTGCTTGC GTTTTTGCTC GTGCAATTTT 20520 

ACAGCGGCAG TCTGCTGGAA CGGCGCGCCA TTTCTGTTCC GTTAGTTGTG AGAAATGAAG 20580 

GCGCACTAAC TCCTGCGCTT CGCTTTCCTC AAAAGGTGAC GGTGCTGATG CGCGCTTCAC 20640 

GTGATACGCT CGGCGCACTG CGCGGATCTG ACATTGTCCC CTATGTGGAT TTGTCCTCCT 20700 

ACACAGAGGA GGGAGAGTAT GCAGTTCCTG TGCGGGTGAC TGTAGCTGAC CATGTTGCGC 207 60 

CACCAGATGC GCTTGAACTT GTCGCAGATC CTGCCATCAT CCCGTTCAAG CTGGAGCGTA 20820 
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GTGTcACCAA AAATATCCCC ATAACCCTAT CGCTTGAGGG TGTCCCTGCG TACGGCTATG 2 0880 

AGCTGCGGGA AGTCGACACA AATCCTTCCA TGGTGGAAAT TCGCGGTCCG GCCTCTTTGC 20940 

TCGTTTCCTA CACACAAGCa G TTACCGAAA CGCTCGACAT AACCAATAGA CGCGCGTCCT 21000 

TCTCAGGTGT CATTGGACTT ATCAATCCGA GTACGCTCGT TTCTTTTCCA AAAACTAAAA 21060 

CGCAGTTCGT TGTCAGGGTT CGGGAAGTTT CTGAGCTCAA AGAGCTTGAG ACAACACACG 21120 

TCTCGTTCAC CGACTGTGCC CCTCACCTTA CGTTCAGCAT CGAACCGGTC ACCATACGTG 21180 

CACAGGTGCA GGTGCCAAAG CATGTAATTG AAGAGATGCA CCC AGAGGAG TTCTTCTCTG 21240 

TTTCTGCAAG AGAAATTACT GAACCCGGAC GCGTGACCGC TCCCCTTATC CTCTCGCTGC 21300 

CCGAACACGT GCGTATGGTA CAGTACAGTC CCAAAGAGGT TCACGTTCAT GTGcGCGAAG 21360 

cGcakTCAGT CCCGGCGGAC GGACATGAAT GATCATTGGC GTGGGAATAG ACATAGTAGA 21420 

AATAGAACGA TTCGTATCTT GGACACACAA CGTGCGCCTG CTCCGTCGCT TCTTTCATCA 21480 

AGAGGAGATT GTAGACTTTT TTAAAAACCA CATGCGAGCG CAGTTTCTTG CCACGCGCTT 21540 

TGCCGCAAAG GAAGCATTTG GAAAGGCACT CGGTACGGGA CTCAGAAACA TGGAGC T AAG 21600 

GAATATTCGG GTGTGTCAAA ATGGATGGGG TAAGCCGAGA CTAGAAGTCT ACGGTGCTGC 21660 

ACAGGCTATG TTGGCTGCAA CAGGAGGCAC GCATATACAG GTGTCGCTAA CGCATGAGAG 21720 

AGAAGTCGCC TCAGCCATCG TGATTATCGA GGGAGAACCG CTATGACCCG GTCATCTACA 21780 

AAGAAAACAG ACAAAAAAGA AAGCACTGTG TCTTTCTATT CAAAAGAGCG CATCGAGTGT 21840 

CCGGTGTGCA CAACCGTCTT CCAAAGAGAA GAAATGCATT CTGGAGGAGG TCGTACCATT 21900 

GCTGGTGATT TAACCGATGA ACTAAGAAGG ACATACGAGA CGTCCGCAAA GTATGGAGAG 21960 

GTATTTCCTC CCATTTACCA CGTGGTAGTT TGTCCCACCT GTCTTTACGC AACCTTTCTG 22020 

CAAGACTTTA GAAATATCGA GCGTGGGATT GTCACTAAAC TTTCTTCCAC CACATCACAG 22080 

CGCCGCACAT CAGTTGAGCG GCTCATTCCT CAGGTGGATT TTAGCGCACT GCGCACACTC 22140 

TCCTCTGGGG CGGCGGCTTA CTACTTGGCA ATACTGTGCT ATGACTTTTT TGATAAAAAG 22200 

TATTCTCCTA CCATTAAACA GGGGATCTGC GCGCTCAGGG CAGCATGGCT TTTTTCTGAT 22260 

CTTGAAAAAA AAGATCCGAA CGAGCATTAC GATTACATCC GCAATCTTCT ATACCAAAAG 22320 

GCACTTTTTT TCTATCGCAA GGCAATTGAG TGCGAAAGCc AGgCGAAGAA ATTATCGCAG 22380 

GATTAAAATC CTTTGGACCG GACACGGATA AAAATTATGG GTACGACGGG GTACTCTATC 22440 

TTTCGTATCT CCTTGAGTAT AAATACGGGA CCAAGCGCGA CAGAGCAGTC AGAAGGGAGC 22500 

GCATGCAGCG GaACAAACAA GGACTTGCAA AGATATTTGG CCTAGGAAAG TCTTCAAAAG 22560 
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AGAAGCCAGG TCC ATTGCTG GAACTCGCCC GACAATTGTA CGAAAACCTG CTCGCAGAAT 
TACACGAAGA CAGTGAAACT ACATGAATGA TGTGCGCAAA ATTCTCTTGC GTATTTCGTA 
CGATGGAACA CGATTTTGCG GATGGCAAAA ACAGGTCTCA GGCTCACGGG AACGTGCTCC 
CTCTGTCCAA GGTGAGTTGG AAAAAGTTGC TGAGAAAATT CACCACCAAA AGATAGCAGT 
CATCGGTTCA GGGAGAACAG ACTCTGGCGT ACACGCAGTA GGACAGGCAG CACATTTTTG 
TACCCCCATG AGAAATATAC TCGCGTATCG CTTTATCCCT GCATTTAATT CGCTGCTCCC 
GC AC TCC ATT C GC ATT AC AG ACGCACGCGA AGTCTCCTCT CAACTCCACG CACGCTTCTC 
TGCCGTCATG CGCACGTACC GTTACCACCT CCACTGCGCA CCCGTCGCAT ACGCGCACGA 
ACTGCCTTAC TGCTGGCACA TTGCGAGAAT GCCCGATATA CACTTGCTCA ATCAATATGC 
TGCAACACTC AAGGGAGAAC TAGACTGCAC AAGCTTTGCT GCTGCAGGAG ATAAAAGTGC 
GAGTAAATCG CGTTATTTTT ACGACACACA CTTTTCTTTC AACCATCGCG T AC TGACCTT 
CGAAATCTCT GCTAATGCCT TTCTCTGGAA AATGGTGCGC TCTCTTACAG GAACCCTACT 
ACACTGCGAA AAGAAGCGGT GCTCCGTGCG CGAATTCGTC CGCATTTTGC ACGCGAAAGA 
CAGGCGCtTG CAGGGCCCAC CGCACCGCCG CATGGGCTAT TCCTATGGAA CATC C G TT AC 
CCCGAACACT TAC TCCGTGC AGAATAGGAA CACCCTCGCA CGTGAACTGG CATCCACAGG 
CAATGCAAGG TGGAAGACGT ATTAAGCATG CACGTTACAT CTCTTCAAGA AAAGGAATCA 
GCACCAGaCG CATAGCTGTT CTCAGCACTA TGCGCACCGC ACGCACAAGT TCAAGCCTTG 
CACACGCGTA GTCCgGTCGT GCTTCACACA GAATGGGACA ATCATGATAG AAGCGACTGA 
AGCTTTTTGA GAG TG TAT AG AGATACCCGG TAATAACGCT CGGATCATGT CCCTGTGCAG 
CGCGCGTGAC ACACGCAGGG AAACGTGCAA GCGCCTTCAC CAACTCCCAC TCAGCTTCGT 
GCGTGAGCAA TGCAGGGTCA CACCGGACTT CACGAGGTCC CTTTTGCTCC ACATCTTCCT 
GAACCTTCTT TAAAAGAGAA GAGATGCGAG CACCCATATA CTGTAAATAG GGACCAGTGT 
TTCCGTTAAA AGACAAAGAC TCTTCGGGGT GAAACACCAT ATCCTTTTGA GGACTGACTT 
GCAATAAAAA ATAATGAAGC GCGGCGATGG CAACATTCTC TGCAATACAC TGTGCGTGTT 
TCAGTGCATT TTCCCGTCCC TTTTTTGCAA TTTCCTCTTC TGCCGCACTG TGCAGACGAT 
CCAAGATATC GTCTGCATCT ACTACCGTCC CCTCTCGACT CTTCATACGC CCATGGGGCA 
AGTTGACCAT GC C ATAAGAG ACGTGATGCA ACTGcTGCGC CCACGGATAA CCGAGCAACC 
TAAGCACAAA GAACAATACC TTAAAGTGGT AGTTCTGCTC GTTTCCCACA ACATACAGCA 
ATTGATCAAA GGGCCAGTCC TGTGCGCGAA AAATCGCCGT GCCAATATCC TGCGTAATGT 
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ACATAGTGGT GCC GTCAGAG CGAAsnAACG CCTTTTTGTC TAAGCCTAGA GAAGACAAAT 243 60 

CCACCCAAAT AGAGTTGTCC TCCATCTGAT AAAAAACGCC GCAGcAnAAC CACGTCTAAC 24420 

CTCTTCACGT CCC TTGGTAT AAGTTTCGCT TTCAAAATAA AGTTTATCAA AAGATATGCC 244 80 

CGTTCGCTCA TATGTTTGTT TGATACCGCG CAACGCCCAT TCGTTCATTG TTCTC CACAG 24540 

CGCACGCACG TGGGATTCTG CACTTTCCCA GCGCTGTAAC AGGTCACGCA CATCGTGcTC 24600 

TGCTTCTTCC GGGTACTGCT GTGCGTAACG GTTAAACTGC ACGTACCAAT CTCCCACAAA 24660 

GCGATCGGAC TTGATGCCGG TATGCGCAGG TGTTTTTCCA TGGGCGAATT TTTGATACGC 24720 

GCACATAGAT TTACAGATAT GTACTCCGCG ATCATTGATG ATATTTACCT tGAACACATC 247 80 

CGCACCACAG AACGCAATAA TACGCGAAAG GCTTTCCCCA ATCGCGTTAT TGCGCAGATG 24840 

ACCTACATGC AACGGCTTGT TAGTATTGGG ACTAGAGAAC TCAACCATGA TACGTTTGCC 24900 

CTGTAAGTAc TGCGTGTGGC CATAGCGCTC CCCcTGCGCA AAGATAGCAT CAAGCGTATG 24960 

CGCAGtACAC ACTCCTTATT TAAAAAGACA TTAAGATAGG GTCCTCGCGC CTGCGGGTGc 25020 

CATACGCACA CATGGACGTG TCTTCTTCAA GCAGTGTGCA CAACTGCTGT GCAAGCTGTG 25080 

CAGGACTCCT GCGCACACGC TTTGCAAAAA GGAATAGAGG aAAAGCTATG TCCCCCATAC 25140 

CCGGCTCCGG CGGCTCTTCC ATAACTAACT GCGCACCTTC GACCGGn 2 5187 
(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21170 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

TGCATGAAAA TACAACCAGC TTGCTGCATT AAGAACGCAT CAGTACCGTA GCGCAGATAC 60 

GCACATGGCT TACCACGCCG CGTAGACGAA TGGTCCTCAA TATCATCGTG CAAAAGGCTG 120 

GCTGTGTGAA TGAGCTCTAC TACCGCGCTC AGTGTGTACC ATTCGCGTTC GGAAATTTTC 180 

TTCCTCTTGT GTCCTTGGAG TTTCCGATGC GTGCACTGCA CACACGCGTG TGCAAGTTCT 240 

GCAGAAAGTA TCAgTAACAA CGGTCTCCAG CGCTTTCCGC CGC AGCTG Ac ACACGCGTTG 300 

CACACGGTGC GCAGGACGCG CGTATCTTCG GACGcAAGGA CACGGAGCCG CGTTCCCATC 360 

CACGAGCGCG TAGtTGCGCa GGAAGTGCAT CGGCGAGTGC TCGTTCAATA TTTCTGAGTC 420 

GTTGAGCAAG TGCaCGTTTC ATAGTACCGT TATACGGCGT GTTTGCCTTT TGTGCGAGCA 480 
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CGATGTGTGG GAGCTGTCAG 
GTCTATAAAC GTGCTGATTT 
TCGGTGTATA AATTAG CGGC 
GTGCTCGATT TGGGCGCCGC 
GCGTGCACCG CCGTGTGTGC 
CGTTTGCAAC GAGTGCAGGG 
AACGCTCCTT TCGATCTTAT 
GTAGATACGA GCGCTTCTGC 
TCCTCTGATG GAGGATTGGT 
ACACACCTGC GTGCGCATTT 
CGTAGTTGTG AGTTATATGT 
GTAAGAATAG GGAGCGTAGG 
CTTGCGTCAT AACC TCGGTG 
TAAGGCGGAC GCGTATGGAC 
GGTGCACTCG TTCGCCGTTG 
TCGCGCGCCG ATTTTGTGTT 
GCATCGTGTG CACACCGTGA 
CCGTCAGTCT GCTGATACGG 
GGGGAGAATC GGCTGTGCGC 
ACCGGGTCTC CATCTTGAGG 
TGAGGACCTG CAGTACACTG 
ACGGAAAAGT GGCATATCGA 
CCATCCGCGG GCACACTTCG 
TGAGTCTGTG CATCCTGCTG 
AGTCCGTGCA ATCAAAAAAA 
TGCGCATACA GAAACACATG. 
CGCGCTGTCG CCGGGTTTGC 
AATTTGCATG GACCAGTGTG 
TAGGGTGACA CTTTTCGGTC 
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TGTAACACAT AGCATGGGGG GCGTGTAGGA AGGTGTGAAC 540 
TTGGGCAAAA AAGGCAGCCG CCgCGGGGTA CCGCGCGCGT 600 
GCTTGATAAA AAATACTCCC TCCTGTCGCG CGCCTCGCGG 660 
GCCAGGAAGC TGGACCCAGT ACGTGCTGGG CACCGCTGCT 720 
GGTGGATGTG CAGCCGATTG CGTCGGACAT TCAGGACGCG 780 
GGATTTGTGC GCAGCAGATA cACGTGCGCG TGTTGCGTGC 840 
TCTTTCTGAT GCCGCACCTC GTACGACCGG AAAC CGC AC A 900 
GTGTCTTGCA GCAGGGGTGT GTGCGTACGT CAACTTCTTG 960 

GTTCAAGGTG TTCCAAGGGT CAGAGCACCT TGCTATCCTT 1020 

CGGTGCGGTG TGTAGTTTTA AACCGCCTGC TTCTCGTCCC 1080 

GGTGGCGCGT TTCTTTCGCG GTACGTGCGG CAAGTAATGG 1140 

CGCCTGTGGC ATGCAATCAG GCATTGATCC ATTTAGCAAA 1200 

AAATTATGAG CCGTAcACGC GCGCGTGTGT GTCTACCTGT 1260 

ACGGTGCGTG TGACGTCGCA CAGGCGGCGC TTTCGTGCGG 1320 

CATGTGTGCA AGAAGCGTCG CAACTGCGTG CGGCAGGTGT 1380 

TAAGTAC TCC AACTGCTGAA GAGATTTCTA GTC TTATTGA 1440 

TTTCTGAGCG CGCGCATATT GCCCTTATCG CACGCGCGCT 1500 

GTGCCACGTG TGGGGTACAC GTAAAGATTG ATACCGGAAT 1560 

CGGATGAGGC CTGTGCGCTC GTGCAGATGG TGTGCGCAAC 1620 

GGGTATGTAC GCATTTTTCT GTCGCGGATT CTGTGCGTGC 1680 

AGATGCAACG TGCACATTTT ATGCATTGCG TACAGTACAT 1740 

TTCCATTGGT GCATGCGGCA AACTCTGCAG CGCTGTTGTG 1800 

ACATGGTGCG TCCGGGATTG TTGGCATACG GCTATGCCCC 1860 

TGCGCAgTGT GTTCCTTCCC GTCATGGAGC TTGTTACCCA 1920 

TACCTGCAGG CGCGTACGTT TCTTACCAGC GCTTGTGGCG 1980 

TAGGTATTCT GCCTATCGGA TATGCAGACG GAGTTATGCG 2040 

AGGTGTGCAT TGGGGGGAAG TGGTATCCGG TGGTGGGGGC 2100 

TAGTGGACCT AGGTACCCCG CTGCGTGTGA CAGTTGGAGA 2160 

CTCAGGACGC AGGTGGCCCA GGACAGGGGG CAGATGTGCT 2220 
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CGCCTCGCAT GCAGGCACCA TTCCCTATGA GCTTTTGTGC GCGATTGGTA AGCGTGTCGA 
ACGGGTGTAC ATCCGGTGAA TATGTTTTTG CAACGTTTAT AAAAAGAGAC AAAGGGAGGA 
AGGCGCGCAt GAATGTCCTC GGAATTGAGA CCTCTtGTGA TGAGACTGCA GTTGCAATTG 
TAAAAGATGG CACGCACGTG TGCAGCAATG TTGTGGc TAC GCAAATTCCT TTTCATGCGC 
CGTATCGTGG CATTGTCCCA GAACTTGCAA GTCGCAAGCA CATTGAGTGG ATTTTGCCAA 
CGGTGAAAGA GGCGCTTGCA CGCGCTCAGc tGACGCTTGC TGATATCGAT GGCATCGCCG 
TAACACATGC ACCTGGGCTG ACCGGATCTC TCCTGGTAGG CCTGACGTTT GCGAAAACAC 
TCGCATGGTC AATGCACCTT CCTTTCATTG CGGTTAATCA CCTTCATGCA CACTTTTGTG 
CCGCGCACGT GGAGCACGAT CTGGCATATC CCTACGTGGG CTTGCTGGCG TCTGGAGGAC 
ATGCGCTCGT ATGTGTTGTG CACGATTTTG ATCAGGTAGA AGCGCTTGGC GCAACGATCG 
ACGACGCTCC CGGGGAAGCC TTTGATAAGG TTGCAGCCTT TTATGGCTTT GGATATCCGG 
GAGGCAAGGT AATTGAAACG TTAGCAGAAC AGGGnTGnGC gCGTGCCGCG CGTTTTCCGC 
TTCCTCATTT TCACGGAAAA GGGCATCGGT ATGATGTATC ATATTCAGGA TTGAAGACAG 
CAGTTATTCA TCAGCTCGAT CACTTTTGGA ACAAGGAATA CGAgCGCAcT GCGCAGAACA 
TTGCTGCGGC GTTTCAAGCG TGTGCAATCA ACATCTTGCT CCGTtCCcTT GCGCGCGCAT 
TACAGGATAC AGGGCTGCCA ACGGCAGTAG TGTGCGGAGG TGTTGC AGC A AACAGTTTGC 
TCAGAAAATC TGTAGCGGAC TGGAAGCATG CGCGGTGTGT GTTCCCTTCG . CGTGAGTACT 
GTACAGACAA CGCGGTGATG GTTGCTGCGC TCGGGTACCG CTATTTGATC CGTGGTGATA 
GGAGTTTCTA TGGGGTAACA GAGCGTTCGC GCATTGCGCA CTTCAGTAAG CGCGGGGGAG 
ATCGTCTCGC TGCACAGAGA AGCGCTGCTT CTCAGCCTCT TTTTTGAGCA TGTGCGGCTC 
AGTCCTTGCT AGGCAGTGTC CCGTTACCTA GATGCTGTGC CGTTTGATGG TAAAAATGAG 
CGACGCGATG AAGCACGCCA ATGGCAGCAG TTCCAACGTG AAGCCCACTA GTGACACGCC 
TGGTACAGtG wACcGCGTGA TAGAAAAGCC TGCCGCACGC AGAAATGCAA ACAGTTGTGC 
AAGGTACATA CACAGTACGC CCGCGATAAA GCACACCGTC TCCTGGGTAC GGGTCTTTGT 
CCACGCGACA ATGGCGAGGA ACACAGCAAC CGCCGTTACC GCTAGGCGTA ACATCAGCAA 
AATAGTCTGT GCTCGAGAGA GCAGAGAGAG AAATTCATTC ATTCGTGGTG TTCCTTTTCC 
TGTTCTTGAA GAAAAAAAGT GCATAGCTGG GTATAGTGCT CGAGCGTAGG GAGTACTGAG 
GGTTCAGTGT AC AGGGGGG A GAGCAGGATA TGCGCGTCAA GCgcACGTGC GAACAaTGCC 
TCATACCGTT CTTCAGAAAC AGTGGGAAAA AGATAAGGCC cTTCCGTGCG CCACCATGTA 
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CTCATAAGCG TTATAAGCTG TGCGCGGTCG CgCTCTTGCC 
CGCACGCGGT TCAGACCTCT GCGCCGTTTT TTCGTGTGTG 
TCCTGCGTGG TGTGTGAGGA GTCTAACAGA GCGTGAGGCA 
AGGGAACCCA CGCGCGCTAA GTCCCAAAAG GCACGAGCAA 
AAGAGCACGT cCCCCGCAGG GGCGCGCGCG GCACAGTGGG 
GCAGGAGAAG GAAACGGCGG CACAACCAAA AAGACATCGG 
CAGGGGCGCC AC AC TGGC AC CTCGCAGGGT GCGGCGCCGC 
GCACACACTT CCTGCGCGCG TTCGGCATGT GCGTAGTTAG 
ACACGTGCGT AGTGCCTGCT GCaGTGCGCA GAAGCACAGG 
CCGCGGCTAA GATAACGCTT AAAGTCCAGG CGCGCGCGGC 
GAAGCGCCGC CGTCAAGGAA TAAATCCGTG ATGCGAACCC 
AAGCCGCGCG CACGCCGCAC GCGTCCAAAC CTGCGAATAA 
ATGCTGTTCT CTTAGGGGAA CACTCCCCCG CTGTGCACCC 
GCAACGCCTC GTCAAACCTT TCTATCCCCG AAATGGACGA 
CTTCTACCGT GTGAAGGTAA CACTCTACCA CTGAGTTAAT 
AGCGAGGGTG TTTTTGTGCA ATTGCTTATC TCACTCTGCA 
AAGAAGCGTG CTCGATGCAG TATAGTGGCC CTATGCAAGA 
GTCTGGTGAG TGATCCGTCT TTGTCCTCTG TCGCTGGTGC 
CGTACGTCGC GCGGCAsTTG CGCGCCAGGT GCACGCCTGC 
GATACGCCGG GCGGTACGCA GGTTTGCCGC TGATGTTATT 
TACTGCCGCC AAACGAGAGG CGCTGTTGGA TGCGTGGGTT 
CGTCGCTGTG GGTTCTCAGC CTGCGCAAGC GTGTGGTGGC 
GCAGTACAGC ATGGTACTTC ACTTTGTGTG GTACGGTCTT 
GAGGGTGCAG CTTGAACAGG CGGTGCCTCA TTGGCCCCAG 
ACCACAGCTT AAGCGTCTGA TTAAGGCTTG TTTGACCGGG 
CTGTGCTGCA GTGAGAACGC TGCTGGGGAT AGCGGGCATG 
GATTCACCCT CACGAAAAGT AGAGGCACGA AGGGGAGGTC 
TTGGGACGAC CCACCGCCGC TTTTTGGCGC GGTGTCTTAC 
GAGGGGTGTG CGGCGAGAGG CTCGCGACAC TCCCTGCAGA 
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GCGCGCATTT TTGTGCGCGg 4020 

GCAGCGCAGG GACCGCGTCT 4080 

TGGCGCGTGC GGGCTGAGAA 4140 

TCGCCTGCGC AATCGGAGCA 4200 

CACGGAGCGC AACAAACGGT 4260 

TTTGCGGGAC GCACGGTGCG 4320 

AAGCAAGGTT GAGAAAACGC 4380 

CGCACCTCGT GCGCAGGAAA 4440 

TGGGTAGGGG ACCCCACAGC 4500 

TGGCGTTCCG TCCAAGAATA 4560 

CACGCTCAGT GTATAGGTAA 4620 

TCTCCGCCTG ATAAGCTTCC 4 680 

TGCGCATACc CcCCCTGCGG 4740 

TACCGGACTT GAAC CGATG A 4800 

CGTCCTGCGC GCAGCATAAC 48 60 

CTGTTGTCGT AGGATCCTGT 4920 

AGAGGTTAGT GAGCGTACCT 4980 

AGGAAGCGGG GTGGTGCAGG 5040 

GTTGACTGTG GGATGCGTGG 5100 

GCGCGCGAAG CTCCTGAGCT 5160 

CCCTCGCCTT CCTCTGAGCA 5220 

GCGCCCCTTC CTGCAGACGT 5280 

GGTGTGCTTT CTGAACAGGA 5340 

GTGTATTGGA GTTGCTTTTC 5400 

CTGTTAGATG TGGAAGCGTT 5460 

GACACTGCGG CGGACGCGTC 5520 

GAGATGAGGT GTTCGCACAA 5580 

GGGATGCAGG AAGGGGCCGG 5640 

GGGACTGCAG AGGGACTGGC 5700 
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CACTTCACAG CCTGAAGATG GAGAAACGCG CGCTGCgcTG CAgsGGATTG ATCACTTGGA 
CACGCAGCTC CTGCAGCTGG AGCGGGACCT TGCCCATTAC CTAGAGATGG CCGAATTGCC 
TGATCCCTTC TCAGAAAACT AACGCCCCAC CTCCTACTGG AGGAGGCGTC TCTTTCTCAT 
GATATCAAAG ACGCTCTCTG GACCGCAAAA GTGCCCGGCG CTCGCAAACA CAAACGTTAT 
GCCGTGCTGT AAACTCAGCC GGTAGAATTC CTCTGCCCaC GCAGCGCGCC GCGATGCAGC 
AATCTCTGCC aCGTACCGGC GGTGGAGTCC ACcTGCaGCG TCCTTCGTTA CCAGTGCGTC 
CAGTTCCGTG CTTACTCTTC CCAAAGCGGT TTTGTCGTTC GAGAGGTAAC TGCGCACGAG 
GGCACCGAGC CTGCCCTTAA AGTCCGCGGG ACTTTTCCCG AGAGCGATCA GTGCACGAAG 
CAGCGTTATC TGCTCCTCAC GATTGCCAAA GGAAAGCATG TTCAGGTGTT TTTGGATGCT 
GTCTAGTCCA AGAATTTTGC GGTTCCCCGC GCGCTGGTAC AGGAACGCTT CGATGTTCTT 
CCCGGAGTCC AGCT TTGTAT GCGCGATGAG TGCCTGGTAG AGTGCTACGC GCATGACCCA 
CGGTTCAAAT CTTGAAAGCG TGTGCATGTC ATCTCCCAGG GTGCTCCGGA GCATTTCTAA 
CTCCTCCCGA GAAAGGGAAG AGAGCGTTGG AGCAGCGTTT TCCTGCTCAA GCATCCCGTG 
aAGCATACGC CTCTGGAGAA CGCTGGCAAA ATTTTTAATG TCCTCTGAAC CGAGCTCAGC 
GTAGAGACGA CTTGCAGAGT CAAATACATC AAGGATTTTG TCCTGAAAGT GCAGCAGCTT 
TTCGCTGCCA ACCGAAATAG TGCCCAAAAT ATATACAGAC CCTTGAGGAC CACGTATTTC 
CCAGAACATA CGTTCCTTGT GGGAGATAAG CGACGGCAAG GCACCGCGAG AAAGGCTCGT 
GCAGCACGAA AGGAAAGGGA GAATAAGGAG CACACACAAG AACACGATCG CACAGCGTGT 
GGCACACAGG GAGCAGCGCT TCAAAACGGT CCTCCTGAGC AG TGG AAATA CAGGACGCCC 
GGTGGTATTC ATCGGGCCTA ATGCAGAGGA ACGCTCCTTT TCAGAAGGAC CCACGTGGTG 
CCCTTACCCC CGCGCCGTTC TGCAGGgTGA AAGAGTTCAC CCGCGTGAGG ATGGGCCTGC 
ACGTAGCGCT TGACCGAGGG AGCAAGGACA CTCCCACCCT TGGAATGGTG GCCCTTTCCG 
TGGACGATTT CAACCTTCTG GAGGAGCCgC TCACGCGCCT GCGCAAAAAA CGAATCAAGT 
GCACTGCGCG CCTCACTGCA CGTCATGCCA TGGAGGTCTA AACGCGCCTC AGGGACTGCG 
GTGCGCAgCT TCCTCsTTCC CCGTcGGGAA TGGATGGAGA AGGTACGCCT CTGcCGCGCA 
TACTCCGCAC AAGskCCTGC AGCGTCCTTG TCGAAAAGTC CGTaGCGcGC AAGCGCmACT 
TCCATCAGGG AAACACGCGG CGCAgCGGCA GCCTGCGAGG ACGGAAGcgC GCCACGGCGC 
CGTGCGCGCA CAGTCGCAyT tCGCACGkcc GCCCTTCGGG cTCCTTCCCA CGTACGCAAC 
GTCCGGGCAA ACGcGCTCTG gCCcTcAGAG CcTCCTCAAG AGGcAGAaTA TCcTTACGtC 
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TTTtcCCCGC naTGAGCGGg CGcATTCTTT CATAAAAAAC CTGTCTGTGC AATCAACCGC 7500 

GTGAAGcgGC ATCCTGCGTG GTAGGAAGGG GAAGACAGGG AGGCGGTCAC GGTGCATGAG 7560 

GAGTGTAATT TTCAGGGCCT CACAGGGATG CTTGCGCCCC gGAGGGgaTG TGCGATAATC 7 620 

GGCCCCCGGG GAGGGCGAGC GGTGGAGGTG AGAGTTCGGT ACGCaCCgTC TCCGACGGGg 7680 

CTCCAGCACA TCGGGGGTAT TAGAACTGCT CTCTTCAACT TCTTGTTCGC GCGAGcgCAt 7740 

GCAGGCGTAT TTGTCCTCCG TGTCGAGGAT ACTGAcCGCA GTCGCTGCAC TGCAgyGTTt 7800 

GAGCAGAACC TTTACGATAC GCTCCGTTGG CTTGGGGTCT CCTGGGATGA GGGGGGAGGG 7860 

TGCCCAGAAA CAGCGGTGAA GCAGGGCGCG CGGGGGGATG GCCGCTCTGT TGCTCACGCT 7920 

GGTGGGgCCT ATGGCCcTTA CACGCAgTCT GCACGGACAG ATCTCTACCG CGCGCAGGTG 7 980 

GCGCGGCTCG TTGAGACAGG GCAGGCGTAT TATTGTTTTT GCGATGCGTC GCGGCTCGAG 8040 

CGCG TTCGTA AGATCCGTAC GCTCAACAGG ATGCCCCCCG GTTATGACCG GCATTGCCgC 8100 

GAGCTCCTGC CTGAAGAAGT TCGGGAATGT CTCGCATCCG GGGTTCCACA TGTGATCCGC 8160 

TTTAAGGTCC CCTTGGAAGG GAGTACTCAT TTcCGCGATG CGCTGCTCGG TGATATCGAG 8220 

TGGCAAAATG AGGAGATCAA TCCAGACCCG ATTTTACTGA AAAGCGACGG GTTCCCCACT 8280 

TACCATTTGG CTAATGTGGT AGATGACCAT GCTATGCGTA TTACGCATGT TTTGCGCGCT 8340 

CAGGAGTGGG TTCCCTCCAC CCCGTTACAC CTTCTGTTGT ACCGTGCTTT TGGCTGGCAG 8400 

CCCCCGCTCT TCTGTCATCT TCCGATGGTT ATGGGGGCAG ATGGGCACAA GTTGTCAAAG 84 60 

CGGCATGGAG CTACTAGCTG TGATGAGTTC CGCAACGCGG GgTATTTGCC TGAAGCGTTG 8520 

CTCAACTATG TTGCAATGCT CGGTTGCTCG TACGGAGAAG GTCAGGATCT GTTCACGCGA 8580 

GAGCAGCTGT GTGCGCACTT TTCTCTGTCG CGTTTAAATA AGTCACCGGC TGTTTTTGAC 8640 

TATAAAAAGC TTGCGTGGTT TAACGGTCAA TATATCCGTG CAAAAAGTGA CGAGCAGCTG 8700 

TGTGCGCTCG TGTGGCCTTT CATTGCAAAC GCCGGTGTGT GTGGCCACAT TCCGGCAGAT 8760 

GTGGAAGCAG GAGCTGTGCG CACACGACGT TTTGCAGACG AGGCGCCGTG TGCGCCTACA 8820 

GAAGCGC AG S GTtCCATGCT CATGCGAGTT ATCCCGCTGA TTAAGGAGCG GTTGCGGTTT 8880 

CTAACCGATG CGCCGGAGTT GGTGCGTTGT TTTTTTCAAG AACCGTCTCT CCCTGAACAA 8940 

GGGGTGTTTG TGCCGAAGCG CTTGGATGTT GCGCAGGTGC GCGCGGTACT GGTGCGCGCC 9000 

AGGGGCCTGG TGCACGAAAT AGTGAGTGCC AGTGAACCGG ATGTTGAGGT GCTCTTGCGT 9060 

GCTGAGGCAG AAAAGTTTGG AATAAAACTT GGTGATTTTC TCATGCCCAT TCGCGTTGCG 9120 

CTCACCGGTG CTACCGTGAG TGCCCCTCTG GT AGGAAC T A TCCGCATCCT GGGGGCGTCA 9180 
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CGATCCTGTG CGCGTATTGA 
GTGGGAGGAG GCTGATATTC 
ATGTGTTCAA AGGCGGGCGC 
ACAGGGGGGA AGGTGAGCGG 
GTGGCTTTGT GTTTCCATCT 
GCCCTATGGG CATTGCGCTA 
GCCTACATGA TCATATCGTC 
GGACGTCTGG CCACGTCGAT 
GTCGCTTTCG CGCGGATCAG 
GTGGGGCCCT C AC GGGCGTG 
CGGATGAGCG TGCCAGTTTG 
ATTATAAAAA CGTCCTGCAA 
GTAAGGCGTT TCGCAATGAG 
AACAAATGGA AATGCAGTTT 
GGTGTGCACA GCG CTGGGCT 
GGCGTACCCA TGCTGCACAT 
ATGCATTCCC TATGGGCTTT 
TGACgcGCCA CGCGCAGCAC 
TGGATGCGGC AGCGCGTCGG 
GCTGCGTACT CATGTTTCTG 
TCGCGTTTTC GGAAACGACA 
TGCGAATAGT GCTGAgGTTG 
GGTAAAAAAA G AC GG ATTGG 
TTTTGCCTGT GATTTTGATG 
GGTGGGTACT CCCTTTTGTG 
GGTACtCTGC GCGACAgCAT 
TTGCGCACAG AGATAAAACA 
GGGAAAATGT CACATATTAC 
GTAGAGGATC GCTCGCTTCT 




ACACGTCATT CGTGAACGCT TTTCGGATGA CAGTCAAGGA 9240 

TCAGTTAACG CGGGCTATAG GGAAAAGAGG TATGCAGGGG 9300 

GATCTCTGTG AGGTCGCCgC GGGTGCTGAC AGTATGCTAG 93 60 

CGCkTCACCA TGGAGAAGAT tGTCGGTCTC TGCAAACGGC 9420 

TCAGAAATTT ATGGTGGCCA AGGAGGTGTT TGGGACTACG 9480 

AAAAACAATA TTGCCCATGC CTGGTGGCAA GATATGACAC 9540 

GGGCTGGATG CAGCAATCTT GATGCATCCA AACGTATGGC .9600 

CACTTCAGTG ATCCTTTGGT TGATTGCACG GTGTGTAAAA 9660 

GTTGCCGTGC CGTCTGCCGG GGGACCCTGT CCTCAGTGTG 9720 

CGTAATTTTA ACCTCATGTT CAGTACCCAC ATGGGTCCTA 9780 

CTCTACCTGC GTCCTGAAAC TGCGCAGGGG ATTTATGTAA 9840 

ACTACACGCC TGAAGGTGCC TTTTGGTATT GCCCAGATCG 9900 

ATTGTCACAA AAAACTTTAT TTTCCGTACG TGTGAATTTG 9960 

TTTGTGCGCC CCGCAGAGGA TACTCACTGG TTTG AG TACT 10020 

TTTTACCAAA AGTACGGGGT GCGTATGAAC CACATGCGTT 10080 

GAGTTGGCTC ATTATGCACG GGCTGCCTGT GACATTGAGT 10140 

AGGGAATTAG AAGGGGTGCA TAACCGTGGT GACTTTGACC 10200 

TCGGGTAAAG ACTTGTGCTA TGTGGATCCT GATCCAAACC 10260 

TATGTGCCTT GTGTCGTTGA AACGTCTGCA GGATTGAinGC 10320 

TGCGATGCAT ACACAGAAGA ATATGTGCAG GCGCCGAATG 10380 

CAGACAGCTG ATCAAGAAGG TGCTGCACGT ACGGGCGAGA 10440 

CACCtGCGCT TTCTCCCACC ACTGTTGCTT TTTTGCCTTT 10500 

TTGACC TTGC GCGTGCGGTG CGCGACGAGC TGCGTGAGGA 10560 

CaGcTGGCGC GATTGGAAAG CGCTACCGCC GTCAAGACGA 10620 

TCACAGTTGA TTATCAGTCA AAGGAAGATG ATACGGTTAC 10680 

GGCACAGCGC CGGGTCTCTC GTGCCTTTCT TGCAGAGTTT 10740 

CTACCGGCGT CCCTAGGTTG TTGTCCGCTC TCTGCGCGCG 10800 

ATCGCGAAGG AGCTCTCGTA TGAAAGCGTA TTCTTATGCA 10860 

CACTCCTTTT CTGTATCGCT TCTGTGTAGA TCCGCTGTTA 10920 
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TCCGGCGAAT < 


CTCATTACGc 1 


TGTGCGCAAA CGCCTGTATG 


10980 


CTGCTTGCAT 


TTACCCATGC 


GTACTGCGGC 


TCGGTGGGGG 


GTACcTACGC GTATTGGTTT 


11040 


CTAGTTCCTG 


TGCTGTGTAT 


TGTGTAC CTG 


GTCGGAGATT 


GTCTTGATGG < 


GCGCCAAGCT 


11100 


CGGAGAACGG 


GAACTGGTAG 


CCCCTTGGGA 


GAATATTTTG 


ACCATTGTTT 


GGACACCTCT 


11160 


GTTGTAGGAC 


TGCTGGCAGG 


AATTTTCGTG 


CTCGCGTTTC 


GTATACGCGA 


GCCATTTCTT 


11220 


TTGACGTGTA 


TCTTTTTTGT 


TCCCGCGTTT 


GTGCAGATTT 


CAACCCTGTG 


GGAAAAGCTG 


11280 


CACCGCGGGG 


TGATGGTGTT 


TGCGCGCATT 


GGGTCAAACG 


AGATGGTArT 


GCTGACCACA 


11340 


CTCGGCGCAT 


ACGCTGGGTC 


GTTCGAAACA 


CTGCGTGCGC 


TGTTCCTCAC 


GCCGTTGTTT 


11400 


TTTTCCTGTA 


CTCCTGCACA 


GGTATGTGTA 


TCAGTGCTCT 


CAACGGGAGT 


GTGtATTTTT 


11460 


tCGTGTGCGG 


TGTTTTGGCG 


TATGCGAGTG 


TTTTCATGCG 


CACTTTTTTT 


GCATTTATCC 


11520 


CTTTTCTTCT 


TTCTCTGTGT 


ATTTTCAAGT 


ACGTATTTCC 


CCACGCAGAT 


TGGATATATA 


11580 


ACGGCACTGT 


GCACGTTATA 


TCACATGCGA 


TATGCAGAGC 


GCCTTCTGCG 


CGTCATTGTA 


11640 


CAGGGGGAGG 


GAACTGCCCG 


TGTTGAgGTG 


TTGGTGCCAC 


TTTTGTGCGG 


TGTGTTGTTT 


11700 


CTTTTTCCTC 


AGACAAGCTT 


TTGGGTGCAG 


CGGGCGCAGT 


GTAGTATTTT 


GGCACTTGAG 


11760 


GTGGGGGTGC 


ACTTTGTACG 


ATTTGTGTAT 


GCTCATCGCT 


GTTATTGGCA 


TTGGCTGAAT 


11820 


CCTCTTCCAA 


CACAGGAGTA 


GCGTGGTGCA 


TGTGACGCTT 


TTGTACGGAG 


GCCGTTCTGC 


11880 


AGAGCACGAT 


GTTTCTGTAC 


GTTCTGCACG 


TTTTGTGGCG 


CgCACGTTGT 


GCTTACAACA 


11940 


C AC CGTAATG 


CTCATCGGTA 


TTACCCGTCG 


TGGCGTGTGG 


TATGCGCAgC 


CTGCGTGTGC 


12000 


ATTAGAGCAG 


TTGTGTACCG 


GCACTGTCGC 


GCTCAGTATT 


CAGGAAGATG 


AAAAGAGGCG 


12060 


CGTGTGTCTT 


GTCCCGGGAG 


GTGGTACTGC 


AGGCGCTTTT 


GTCATAGCGG 


GGATGCCGTG 


12120 


TGTCACGGAT 


GTGGTATTCC 


CCGTATTGCA 


TGGCAGTTAT 


GGGGAAGATG 


GTACGGTGCA 


12180 


GGGTTTGCTT 


GAGATGCTGC 


AGGTGCCGTA 


CGTGGGGTGT 


GGAGTGTGTG 


CAAGTGCTCT 


12240 


TGCGATGGAT 


AAGGTAAAGG 


CAAAGATGCT 


ATGGCAGGCG 


GCGGGACTTC 


CCGTTTTACC 


12300 


GTTTGTCTTT 


TTCCGTAAAG 


ATGCATGGCG 


TATGCATATG 


CAAGAATTTG 


TTGCGCAgCT 


12360 


TGAAACACGC 


CTTGGCTATC 


CTCTTTTTGT 


AAAGCCAGCT 


CAAGCAGGCA 


GTTCCGTAGG 


12420 


AGCCAGTGCA 


GTGCAGACGC 


GTGCACCGCT 


TATCCCTGCG 


ATTGAAGCGG 


CTTTTCAGTG 


12480 


GGATGAAGTG 


GTGTTGGTGG 


AGCGATATGT 


GCGCGCGCGA 


GAAATTGAAT 


GTGCGCTCAG 


12540 


TGGGAACGGA 


CCCTATACTG 


TACATGGGGC 


AGGAGAGGTG 


ATTGCGCAGG 


GAGCCTTTTA 


12600 


TGACTACGAG 


GAAAAATATG 


CTGATGCAAG 


TGTCGCGCGT 


GTACTCGTTA 


CGGCTCCTCT 


12660 
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TGAGACTGCC CAGTACGAAC AGATTACCAC ACTTGCCCTG CGCGCATACG AAGCATTAGG 
ACTCACGGGT CTGGCGCGGG TTGATTTTTT TCTGTTAGAA ACGGGAGAAG TATATGTGAA 
CGAAgTAAAC ACGATGCCGG GTTTTACGTC GATATCACTC TTTCCCCAAA TATGTCAGGC 
TGCAGGTGTT GCACCGCAGG ACTTAATGGC ACAACTCCTT TCTTGCGCAC GAGasCgctT 
TGCAGCGCGC GCCGCACTGA GCACCGACTT GCACGCCCAC GTGTGTGCGC CCTCGGTGAC 
TGCTGCACAT GACCCCGATG CGCAAGGGGA CGACTGGGAC CAGAGGswCT CGAACCCCCT 
CCCTACTGCT TAGAAGGCAG TCGCTCTATC CGGGTGAGCT ATGATCCCGT GGTACGCTGC 
GAGCAAAAAC CCTGCAAGGg TGGaTAAAAA TATATAACGT GTCAACAATC CTAGAATGCT 
GTGCTGTAGC TCCGACTGCT TATCGGGTGC ACCGTTTTTT GTTATAATGG CGCGCATGTC 
TTTTGTTCAT TTGCATGTTC ACTCAAATTA TTCACTGTTG GATGGAGCTT CTTCATTGCA 
GCGGCTAGTG CGTACTGCAA AGTCGCTGGG ACAAGAAmGc sTTGCgCTTA CCGACCATGG 
GAATATGTTT GGTGCGTTGC ATTTTCAAAA AGTTTGTTCT GCTGAGGGTA TCAAAGCGAT 
TATCGGATGT GAGCTCTACG TGGCACCCGA AAGTCGCTTT GATCGCAGTG AGCATACTAT 
CGGTCGCAGA TACTATCACC TCATCGTGCT TGCTAAGAAT GAGACGGGAT ATCGAAATCT 
AATGGTTCTA TCCTCCAAAG CCTATATCGA GGGTATGTAC TACAAACCAC GTGTGGATGA 
CGAGCTTCTG GCCCAGCATG CAGAAGGGCT CATTTGTCTT TCTTCTTGTC TTGCCGGACA 
GCTTCCTTAT CTGTTATTGC AGGGCAGAAA AAGGGAGGCA GAAGAACACG CGCGCAAATA 
CCGAGCGCTC TTC GGTGTAG ATAATTACTT TATTGAGGTG CAAGATCATG GACTTGATGA 
AGAGAAGAAA GTAGCACCGC TTTTGATTGA GCTTGCATGT AGGCTCGGCA TTCCGTTGGT 
GGTTACAAAC GACGTGCATT ATGCGGAgcA GGnAAGACTC TGTTGCACAA GACATTCTGC 
TGTGCATTGG AACGAAGAAG AATCGCTCCG ATCCCAATCG GCTTAAATTT AAAACAGACG 
AGTTCTATTT AAAGTCTTCT GAAAAAATGG CTCAGCTGTT TCCCCACTAT CCTGAAATGG 
TGCTGAATAC GGTGCGCATT GCACAGAGAT GTAATGTGCG GATTCCTCAG CCTGGCCCGC 
TGCTTCCGCT CT AC C AG ATT CCTCATGAGT TTTCCAGCAA GGAACACTAT ATTCGCCATC 
TGGTCCATCG AGGTTTGTAT GATCGCTATG CAGTAGTGAG CGAAGAAATT AAGGCGCGTG 
CTGATTATGA ACTAGATGTT ATCGTGAGGA TGGATTTTGT TGGCTACTTT TTGATCGTGT 
GGGATTTTAT TACGTGGGCA AAGGAGCATG ATATTCCTGT TGGTCCGGGG CGGGGGTCTG 
GAGCAAGTTC TATTGTTGCA TATGCGTTAA AAATTACCGA CATCGATCCC C TT AGAT AT A 
AGTTGCTTTT TGAAAGATTT ATGAATCCTG AGCGTATTTC TATGCCCGAT TTTGACATCG 
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ACTTTTGTTT TGAGCGCAGA CAAGAAGTGA TTGAGTATGT GCGTGCGAGA TATGGAAATG 
ACAATGTTGG GCAAATTATT ACGTTCGGAA CACTTAAGCC AAAGGCGGcG ATTCGTGATG 
TAGGGCGCGT GTTGGATATT CCGCTTTCGG AAGTTTTGAT GATTACAAAA CTGATGCCTG 
ATGATCCAAA ACTGACTTTT AAAAAAGCGT ATGAATCTGA ACAATTAGCG CAAATGAAGC 
AGGAGCCGCG CTATGCTGAA TTGTTTCAAA TAGCAGAAAA GCTTGAAGAC ACCAATCGAA 
ACACTAGTTT GCATGCAGCA GGTATCGTTA TTGGTAAAAC GGCGCTCACT GATTATGTAC 
CGCTCTACAa GGATTCTAAG ACGGGAAAAA TTAGTACCCA GTTTGGTATG GATTTAATTG 
AAGACTGTGG ATTAGTGAAG ATGGACTTTC TTGGGCTAAA AACACTTACG CTCATCCAAC 
GGACGCAGAA TCTCGTACGA CGTAAAGGGG GTAAGTACAC AACGTTTTCG ATATCGGATA 
TCAGTGATCA GGATCCTACG ACTTTTTCTA TGTTGGCGGA AGGAAAATCT GCTGCaGTGT 
TTCAGTTTGA AAGTCGCGGT ATGCAAGGCA TCCTCAAGCG TGCAAAGCCC AGTAAGATGG 
AGGATCTAAT AGCGTTGAAT GC ATTGT AC C GACCTGGGCC GATGGCATTC ATTGATCAAT 
ATATTGAATC GAAACGTGAT CCTGGGAAAA TAAAATACCC TGATCCGTGT TTGGAAGACA 
TCCTTTCAGA AACATATGGG G TAATAGT AT ACCAAGAGCA GGTTATGCAG GTGGCACAGC 
GCATTGCAGG TTTCTCGCTG GGAGAAGCAG ATATTCTGCG CCGTGCGATG GGAAAGAAAA 
AGCTTGCAGT GATGCAGGAA AAGAAAAAGG AGTTTGCTGA GCGTGCAGAG AAACAGGGTT 
TTGATAAAAA GCATGCTGAG AATATTTTTG AAATTCTTAT TCCTTTTGCA GGGTATGGGT 
TTAATAAAAG TCACGCCACT GCATATTCAG TGGTTGCCTA TCAAACTGCA TTTCTAAAAG 
CAAATTTTCC CGCCGAGTTT ATGGcTGCGA AC CTTTC AAA CGAAATTAAT TCTGcAGAAA 
AATTACCACT CTACATGGcT GAAGCAGAAA AGATGGGTCT GTCCATTCAG AAACCGGATG 
TCAATGCTTC TGAACCTTAT TTTAGTGTTT GTGAAGGGTG CATTGTGTAT GGGTTGTTGG 
GTATTAAAGG TTTGGGTGAG CAGGTTGCGT TTGACGTTTT TGATGAGCGT ATTCGCAACG 
GTCCTTACAC CTCCTTTGTA GAGGTGCTGG ATCGAGTTCC TGCAACCTCG TTAAATAAAA 
AAAATGCCGA AATAATGATT AAGGCTGGAT GTTTTGACCG GTTCGGGGTA ACTCGCGCAA 
GTCTTAC AGC GCACCTCGAC GATGCAATGA AATATGTTGC GCGAAAAAAG GCGGTTACAA 
GTTCTAGACA AGCAAGCCTT TTTGACGAAA CGGATTTAGG AGAATGTTCT GAATACACCT 
TTCCGGTTAT GGAAGAATGG TCCCAGAGGG AGAGACTCCG TATAGAGAAG GAACTGATGG 
GGTATTATAT TTCTGGTCAT CCTCTTGATG AATATCGAAG TGTGATAGGA GAAAAGGCGA 
CATTGGATTT AGGACATATT GAAAATGCTC GTTCTGAAAA TAAATACCTG ATTGTGGGAG 
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TTGAGGATCT CCATGGCTCT GTAGACATAG TTGTGTTTCC TGTGCTGTGG GAGGAGCATC 
GCGCGCAGTT CTTGCCAGAA ACTATTATGG GGTTGGTGGG AAC TG TAG AC TTTTCTAAAG 
AAACCCCGGC GTTCTTGGTA GATTCTGTCA TTGACTTGGA ACAATTACGG TTTGCTCAGG 
TTAAAACTAT TCTGGCTGGA TCGGAGCATA GGCGTGTATC GTCAGGAGAG AAAACTCCAC 
TGCAGAAACG TGGCGTTTCG CAGGAAGTGC ACATCGAGGT GAGTTCTCAC GTTCGTGCGC 
ATGCACAGTT TAAATCGTTG TATGAGATTT TGAGTGCACA TACAGGAGGC TCGGGTGAAG 
TGTTTCTTCA CATGCATGTG GATGACCGTA CGTACGTGGT GTACGTTCCT TCGTGTAAGG 
TATCTGCCAC TGAGGTATTT GCGCAncAAT n T AAAAGG T A ATGAGAGTTT TGTCCAAATT 
CTAAAGGAGT GCGTGCAATG AGTTCTGTTC TATCTACACT CTCGGCATTA TTGTCAGTGT 
ATGCGCTCCT GTGTACGGCA CGCGTATTTC TCTCGTGGGT GCCCCATCTT t C ACATTC AC 
CCCTGGGGGA ATTCTtATCT GCGATATGTG AGCCGTACCT GTCCTGGTTT AGGAGATTTT 
CGTTTATGCG TGTTGGTACG GTGGACTTTT CTCCCATGAT TGCGATTGGG GTGCTCACCA 
TACTCTCAAA CAC TGTCGGA ACTATTTTCC TTGTCGGTTC GGTTTCTGTG TTAAAGTTAC 
TGCTGCAAAT GCTGATGCTG TTGCTGTTGC TGTGGTCGTT GTGCAAGTTT GTGTTGGAGT 
TTTTATTGAT TCTTTTTGCT GTTCGATTTG TTTCCGATCG TATGAATGTA AATGTTCATA 
CGTTATTTTT TGTGATGATG GATAGGATAT TAAATCCGGT ACG TGTTGCG TTGACCGCTC 
CGTTTAAGTT CCTTGATTTG AGTTACCGTG CGTCCTTGCT CTTGTGTGTT C TTGTG AT AT 
TGTGCGCGCG GGTTCTTGGA GGTTTTTTTG TGAATGTAGT GGTGCGGTAC TTTTTGACTG 
GAACACTGCA CGTGGCAGTG ATGTAATCCG TCGCTTTGAG ACAAAGGACT GATATCC CTA 
TTCACTGTAG GCAGTGTTAT TCGTCGAAAT ATGTATTGCC TGAAGAAATT ATTCGGGACG 
GAGGGATTTG AACCCTCGAT CTTCCGGTCC CAAACCGGGT GCCCTAGCCC CTAGGCCACG 
TCCCGTACGC TTGACACTGT GTGTTAAGAA TGGATAGGCT GTCAACGGTT AC CTGCG AAA 
AAGTCTCGAT TCTTGTGTGG GAGATTGGAT GGGCACGTTT GTGGTGTCAC TGCCTGGTGG 
GCGCCGAGAA AAGTTTTCCG AGTGCGTTCC AGCGCGCGTC CTCTTTGAGC GATTTTTTGG 
CACAGAATCG TCTGTGTATG GTTTGATGTG T AACGG T AC A CCGGTACTGC CATGCCAGGT 
GATAGGCGCC GACGCGGTAG TTGAGCCGGT TCGTGAGGAT ACGGTGTTAG GGGCCGCTCT 
GTACCGTAGg ACTGCGCGTT TGCTGTTTGC CACAGCGTTT CACTCGGTGT ATCCGCATGT 
GCGATTGTTT GCaGGGTATC GAGTGCmAGG GGGaTATTGC TACCGTACCG AGGGTGCGTG 



13041 

16200 
16260 
16320 
16380 
16440 
16500 
16560 
16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 



Printed from Mimosa 02/03/22 07:24:03 Page: 401 



WO 98/59034 




CGCAGATGAC CTGGaTGTTT CGTTGGTAGT 
TGCGCCCATT CACATGCAGT ATATGACGCG 
GAATTTTCCA TATTCACATC ATTATATTCT 
GGTACTGGAC GGTTTTTCTG CGTTGTTTTT 
CACCGTCTTT GAGGTGCgGA TGTGTGCTGA 
ACAACGcCAC ATCATTTCTC AGCACAACGC 
GCATCGGCAG CAAGAAGAAC AGACAAAAAT 
TCAGTCTGGT GATGTTGCAA CTTGTGTTGA 
TGAGTGTTGT GCCACAGAAA TTGCACGAAG 
ACCGTCAGGT TCTGGAAAGA CAACGATTGC 
TGGTTACGAT CCGCATGTGA TTAGTCTTGA 
GTGTGACGCG GAAGGTAATC CTGATTTTGA 
TAATAAGTTG TTTTTGGATC TCTTGCAGGG 
TAAAACAGGG AAACGAGAGT ACCGGGGGCG 
TATTATTGAG GGCATACATG GCTTGAACGA 
TGTATTTCGG TTGTACGTCT CTGTG TTCAT 
TTCGGCGTCt G A t GG AAGgT TGTTGCGGAG 
CTGTCGAAAA AACACTTGAA ATGTGGCAAC 
TCCCTTTTCA GCACCGTGCA GACATGATGT 
TGTTAAAGCG CCGTGCaCAG GAAGTTTTAA 
GGGAAGTCCG CAATTTGCGT GCCTTGTTGG 
TTCCGGGTCA GTCGATATTA AGAGAATTTA 
AGCGAGTGCT TTTATAATGC AGGGTATGGC 
GGTGCTTCGA GTGTTATGAC GCTGTATGAA 
CGGGAGATAT CAGGACCTCC CTGTG AG AGG 
AGAGTTCCCC TGTCTTCGAA TAGAGTGATC 
GCAGGTGGTC GGGGGGTAGT CGGCATATGG 
CTGCTCGAGT ACGTCTCGGG GCCTCTTGGC 
CGGGGCTCCA TAGAGCGCTG AAGGACTACT 



400 

GCGTAGGATG AAGGCGCTTG 
TCGGGAAGCC TTGAATCTGT 
GGGTTCGTAC CGGACTGTGT 
TCAGCCGCTC ATGGCTTCTG 
GGGTTGTCTG TTGCGTTTCC 
GTCGCCACAG TTTGTGGTAA 
ATGCTCAGTA GGACAGTTGA 
CATGGCTGAG GCGGCGCACA 
GGACAGCGTG CGCGTGGTGT 
AAAAAAACTT TCAGTGCAGC 
TGATTACTAT GTGGGGATTG 
GTGCGTCGAA GCCTTAGATC 
GAAGCGTGTT GCACTTCCTT 
GGAAGTACAG TTTGGTGAGC 
TCGGCTCATC TCGTTGATGA 
GCATTtGTGC TTGGATGAAC 
GgTTgTsCGa CGcGCAGTTT 
GGGTGCGTGc AGGTGAAGAG 
TTAACAGTGC ATTGGTTTAT 
GCaCGGTTTC TTCTGCTTGT 
AGCAGTTTTG TTCGTTGTCT 
TTGGGCAAAG CGATTTTTGC 
GACACAAAGT GACGCGTGCA 
TATTATTTGA TATTTCCTGA 
AGTCTTCTTG ACATGAATGG 
GCGTACCGCG TCGCGGrAAA 
TACACGGmCG AGCAGCTTGA 
CAGCGATGAC GATTAACACG 
TTAGTCCTCG TGGTTCTCGG 



PCT7^P5/13041 

TGGCGCAGGA 17940 

TTACGCAGTG 18000 

TTTTAACGCA 18060 

TAGGGAGGCT 18120 

CTGAAGGTGG 18180 

TGTATCGGAG 18240 

ATGCGTGCAT 18300 

ATCGGCAGAT 183 60 

CGATAGCAGG 18420 

TGCAAGTACT 18480 

AGCGCACGCC 18540 

TTCCCCTGAT 18600 

CGTATAATTT 18660 

GTTCGCTGCT 18720 

ACCGGCG AGT 187 80 

AGC AC AGGGT 18840 

CGCGgTATTT 18900 

CGCTATATTT 18960 

GAGTTTGCaG 19020 

ACCACGTATA 19080 

GATGTGCATG 19140 

TATTGTCTGT 19200 

GAAGGGAAGT 19260 

TGGAGAATGT 19320 

ACATCCGTTG 19380 

GCGCACTGTT 19440 

CGCACTCGAG 19500 

TGTCGGGAAA 19560 

CAGGAAGTAG 19620 
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AGCTTCGCGG gTCGATCTGC GATGTCGTCC ATCCGGACGG aCGATTGTCG AAgTTCAAAC 
GTCGGGGCTA GGACGCCTGG AGGCAAAGCT GAAGAAGCTC CTCCCTTACC ACCAGGTGAT 
GGTGGTGTAT CCgGTCTCCA GACGTCTGTA TATTAGAATG CTGAACGAGG ATGGCAGCGA 
GCGGCATTAC CGCAAGAGCC CCAAGGAGGG TTCGTTCTTC CAAATATACC GGGAGATCGG 
CAGACTGCAC GACCTGCTCG ACCACGAGCA CCTTTCTCTC CATATCGTGT ACATACACAG 
CGAGGTCATC AAGGTCGACG ACCGGAAGGG GAGAAGTAGG TACAAGAAGC CGCGCATAGT 
CGACAGAAAA CTCCTCGAAG TGCAGAGCTC AGAAGAATTC CGCAACAAGG GGTCCCTCGC 
GCAACCTCTC CTGTCAAAGC TACCTGAAAT CTTCTGCTGC GATGACCTGG CGCAAACGGG 
CACAGGCGTG CACTGcCGct ACGCCcTGCG GTTTCTGAGG AGGAACGGGA TGGCCACCCC 
GCACTCGAAG CGCGGCAGGA CAAAACTCTA CCGGAAGGAA CCGCCGGGGG ACAATCGATC 
ACCTCCTCCC TGGCAAGAGC CACATGGGGA AGGCTTAGCA GAAAAGCTAA GCCCGGGCCC 
GGCCAGGTAG ACGCACTCGC TCATCCTTTA CCAGGCATCA GACATGTCAT CAGGCTCGCG 
CGATTCCACG TCGTAAAGGT CAGTGACCAC CCTCAGGTCA TCGAAATACA CGTAATAATT 
GCCATACGCC TCGAGAGGAT CGCAGTCTAC GCGGAAGCCT aCGATATTCA GCCCAGACTG 
GTTAGGGAAA CGGCGGCTCT TCTGTACGAT ACCCGTCTTC CCATCAACAT GCTGAGGGGG 
GATTGCGACA CTCATGAGCT TCCAGCCGGA AAAATCGAGC TGTCCCATGT GCAACTCAAA 
GCGCTGTCCC CAGAAATCCT CCAGCAAGAG ACTGAGCGAG TGCGGGTATC CGCGCCCAGC 
CACCCACACG CTCACTGTCT TTGCGACACC CTCAACGGGC AACGGCTTAA CGGAAGAAAC 
CTCAAAACTG TTGTACCCTC GCCGGTAAAA CGAAACtTCG CGCCAAACAC CTTAGAGTCG 
GGAATCTTCA TATCCCCTTC CTCGGGGATA GGGCGCTTTC TGGCAGGTCC ACCCTCAAAC 
AGGCGCCCCT TTATAGTCCc TTCGTCAGAA GACATGGAGA CAACCCAGGT CCCTTCATTC 
TCAAACTTAT CTACGGACAC TTCCTTGAGG CGTTGCGCAG CAGCATACAT CCCTATGCGA 
GATGGATCGG CAACATCCCT GCTCCCAGCC GCCTCCTGCG CATGGGCAGA AAAAACCAAC 
ACCCCTACAC TCACCACCGC TATCTTCTTC ATTTGCCGCT CTCTCCTTCC TCCTTGAACT 
GAGCATTTCT CAACTCGTGT CCATCGTAAA AGTCGATATG CATGTTAGCA AGCGCCTTGA 
ACTGGTCAAA GAACACGTAG AAATCATCCA CCCGCTCTGA TGGGCTAGTA 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1X516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
ACGATGATAA TCCTCCTCCC TTCAATTCTG ACTTCGGCCT TTnCCACAAC GCACCCGCGT 
CATTGACTTG ATCTACTTTC TTCTGAGTAT CCGTTACCGC TCCATCCTTG TCGTACTCCT 
TCTGCAGGTA ATGCAGATAG ATGGGAAACG CGATATGATC AGAATACTTC TTAATTACCT 
CTTCAAGACG CCAGCGCGTT GCAAACTCGG AATTTTCCTG GCTCAGGTGC AACACAACGC 
AGGTACCGGC ACTACCCTCA GCAACCCCTT CAAGTACTGG GAAGGCAGCC GCATCAACCT 
CATCCAAGGT ATAGGCATTT TGCCCTTCAG ACGTCCACTT CCACACGGTG TTCTCTGCAG 
CTTTCTTGGT GATTACTTCT ACTTTGGAAG CGACCATGAA GGCAGAGTAA AACCCTACGC 
CAAACTGGCC TATCAGATTG GAATCCTGTT TTTGATCACG CGTCAGCGTA CTGAGAAACG 
CCTTTGTACC GGATCGCGCA ATGGTACCTA GATTGGCCCT CAGATCTTCT GCGTTCATGC 
CAATACCCGT ATCACGCACA ACAAGCCGTT GAGCATCTTC TTCAAACGCG ATGTCTATAC 
GCGcTTCGCA ATGCAACTGC TTGTACG T AC CATCAACAAG TGCCTCATAC TTCAACTTAT 
CTAACGC ATC CGACGCATTA GAGATAAGTT CCCGGAGAAA AATC TCTTT A TGGGAATAGA 
GAGAATGGAT AATCAACGTT AGCAGCTGAC TCACTTCAGT TTGAAACTCG TACTGAGCCA 
TGTATCCTCC CAGAGGTTAA AAAAGATTCC ATTACGCCGC GCACAGACCG CGCGCGAAGT 
GTAGCACAGA CTATGCAGCA CAGTAAACCA ACCGGAACAG GTGGTACACG CTGCCCGATG 
AACACCAGAC AAAAGAACCC GTGATTGTAT AGCGCTCACA CCCCATGGTA TGATGGGCAG 
GTCATGGATT ATCCGAGAAG GACTATAGCT TGTGGCGAGC TGCGCAGGTG CCACGTCGGA 
ACGGTAGTTG TGCTCAATGG ATGGGTCCAC CGAAAGCGGT CGCACGGAAC CGTTAGTTTC 
TTTAACATGC GCGATAGGTC CGGAATAGTG CAGGTTATAG TGAGCCAGGA GGAAAACGCT 
AGCCTGTGGT CCACGGTAAA CCGCATACGG TTGGAATGCT GTCTTGCAGT CGAAGGCGTG 
GTGCGAGAGC GACCTCCTTC AATGATAAAT CGCGCCCTGC ATACCGGGGA GGTGGAGGTG 
CACGCTCGCA CGCTGTACGT TCTCTCGGAG AATGCTGTGC TTCCGTTCCG CGTTGATGAT 
GTTGTGCATG CGCACGAAGA TATACGCTTA AAATATCGCT ACCTCGACCT GCGCTCTCAG 
CGCATGCAGG AGCGCATTGC ACTGCGCTCA CGCGTTGCCC TGGCCATACG GCAGTTTTTA 
AGTATGAAAG GTTTCATCGA GATCGAAACT CCCACCTTCA TCTGCTCTAC CCCCGAGGGG 
GcACGTGACT TTGTTGTCCC TTCCCGAGTG TGCCCCGGGC GTTTCTATGC CCTGCCACAG 
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TCCCCCCAGC TGTACAAGCA GCTTCTGATG GTGGCAGGGT TTGACCGCTA TTTCCAACTT 
GCCCGTTGCT ACCGAGACGA GGATGCACGA GGCGATCGTC AGCCAGAATT TACCCAGATA 
GACCTTGAGA TGAGCTTCGT TTCTCGAGAC GATGTTATGC GGGTGAACGA GGATATGCTT 
CGGTACGTGT TTAGAACCAG CATCGGTGTC GAACTGCCTA CCTTTTTTCC TCGGCTT AC C 
TACGCGCAGG CGCTAGACCA ATATGGAACA GATAAGCCAG ACATGCGCTT CAAACCGGTC 
CTGCAGAATG CAGACTTTAT GGGAATGCTT GGCACGTTCA CCCCGTTTGA AGAAGTCGTC 
GCACAGGGTG GC AG CATC AG AGCACTCGTT CTTCCGGGCA AGGCACGTTG CTACAGCCGT 
AGnAAA tCGA AGCGTTGGAG TCTATCGCTC GAGC ACATGA GGCGCACCAC CTTTTTTGGC 
TTAAGGCAAC CGGTGGAGGC CTCGAGGGGG GTATCGCAAG GTTTTTTGCa GGGGTAGAGT 
CCGAAGTACG CCGGCGACTT TCTGCTCAGG ATGAAGACTT GTTGCTCTTT GTCGCCGATT 
GCCGGCACCG CGTGTGCTGC GTCGCACTCG GCGCAGTGCG CAGCGCTCTT ATCAGGGACG 
AGTCGTTCCC AGAGAAGGAG TTGTTTTCTT TCGTGTGGAT CGTTGATTTT CCCCTCTTTG 
AATGGAACCC AGCGGAAAAC AAGTGGGACC CTGCTCATCA CATGTTCTCT GCTCCTCAGG 
AACAGTATCT TGAGACGCTC GAGCAAGATC CCGGTTCGGT AAAAGGTGAC CTCTATGATT 
TGGTGCTCAA CGGGTATGAG CTGGCTTCAG GCTCAATTCG TATCCACGAC ACACAGCTGC 
AAAAACGCAT CTTTAAGATA GTGGGATTAG ATCCTGAAGA AGCGGGGGAA AAGTTCGGGT 
TTCTCACAGA AGCGTTTAAA TACGGCGCGC CgcGCACGGc GGCATcGCAC ACGGGTTGGA 
CCGCCTCGTG ATGCTCATGA CAGGAAGCGA GTCAATTAGA GACGTCATTG CTTTTCCTAA 
AAATACACTC GCCGCCAGCC CCCTGGACAA TTGTCCTAGC GTGCTCGATA AGCGTCAgCT 
TGaCGAGTTA CACCTCACTG TACAC GTCTA GGGGCATCGC TACTCGCTCG TCGGCGTAAA 
ATACCTACCA GGGGGGGGAG GGGTACATGG CTTTTACTGA GAAGCAAAAG GGTACTTTGT 
GCCTAATGTG CTCGAGTTTT TGCTTTAGCG TGATGAGCGT CTTTGTGCGT CTTGCAGGGG 
ATCTCCCCTC TATTCAGAAG GCATTTACGC GTAACCTGGT CTCAACGCTC ATCTCGGGAT 
CTATGCTCTT TCGTGCGCGT ACCCGCGTCC ACGTGCAGGA TCTCCCCATG CTCTCCTTGC 
GTACCGTGTG CGGGACGCTA GCAATCGTCG CAAACTTCTA CGCAGTAGAA CGCTTAACAT 
TGGCAGACGC GTCGTTGCTT TCGAAGCTCT CTCCGTTCTT TACCATACTG TTTTCTTGCC 
TTTTCTTGGG AGAACGCATT GCGCCGTATC AAGTCGTCGC CCTCTGTGGT GCCTTTGCTG 
CAGGCACGCT CGTGGTCAAG CCGAGTCACA CCCTTTCTCA CCGTGTATTT CCCGCGTGTA 
TTGGCGCAGT AGGAGGCATG ATGACGGGAG CTGCGCACAC GTGCGTACGC TACCTCTCCA 



1620 
1680 
1740 
1800 
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CCCGTGGCGT AGAGAAGTTC TTGGTTATCT TTTTCTTTTC t TCGG ATCGC TGCTATTGCT 3360 

GCTCCCTGCA TTTATATGGC AGTACCAACC GATGAGCTCA CCGCAAGTGc TTACGCTGTG 3420 

GGCCGCAGgA GTGGCAGTAG CAGGTGCACA GTTTTTTCTC ACTGTTGCGT ATCGATACGC 3480 

GCCAAAAAAG TCGATTCCAA TTGACTATAC CCACATCTTA TTTTCGACGG GCATCGGTTT 3540 

CTTGTACTTT AAAGAGGTGC CCGACCACTG GACCGTAGCG GGCATCGGTA TCATTCTCGC 3600 

CATTGCCCTG TACGTGTTTG CGCGCGAGcg TGaACGGAAA GAACCCACCG TGCCGTCGCA 3 660 

CACACGCTAG AGCCGATGGC ACGCACGTAC GCGAAgCACA TGGTCTACCC CATGCTTAGA 3720 

TTTTTCTCGG TAAAAGAATG AGGCAGGTGC GCGTGACGTG CACAGGAACT TCCACCGCGT 3780 

ATTTGCCGTC GTGCGGCGCG TCACGTACAG CGACAACGTG GAGAAAATCC TTTCTCGCAA 3840 

GCAGCCgCCC TGAGGGGCGC TGG T ACTG AG TAAAATCGTA GCTCACACCT ATAACACGAC 3900 

GACCACCAAG CGGTACCGTA ATGCGTCCGT TAATGTCGGT AcGGTAcGTT TCATGCTCTG 3960 

CATGCTCGAA ATGGACAGTG CAAGGCAGCG CTTCGTACGA GATGTGGAAc ATATCCGCCA 4020 

CGAAGATATG CACGAAATGG GTATCTGCGC GCGGCGgGGT TTTACATGCA GCGACACACA 4080 

CCCCCACCGT AAGCGCCACA GCCAGAAAAA ATGCTGCACG CGCGCTGTAC CCACACAAGG 4140 

TAACGGAGAT TGCCGCACGC GAGGTTCTTC TCGTATACTC ACCCCTCGTA TGAGTACTTG 4200 

GACACACATC TGGTCTACTG CGTTTACCTT GCTGTTTATT ATCGATCCGA TTGGGAACAT 4260 

ACCGGTGGTA CTGTCytGCT GCGCACCGTG CCAGCTGAGC GTCATACCCG GATCATTTTT 4320 

AG AGAA CTGC TTCTAGGACT GGTGCTCATG CTCTCCTTCC TTTTTTGCGG AAAAGTTTTC 43 80 

CTATCTTTGT TCCAGCTAGA AACGGGAGTA ATGAAAATGG CCGGAAGCGT CATTCTCTTT 4440 

CTCGTTGGCA TCAAGATGGT ATTTCCTGAT CAACACGCGC TCCCCTCCAC CACAGAAGAG 4500 

GAACCGTTTA TTGTTCCCAT CGCCACTCCC ATGATCGCAG GTCCTTCGGC GTTCACCACG 4560 

CTGGTAATTA TGGGAGAGAC GAAGGGGACA TCCCGTCTCG CCACCTGTGc tGCGCTGCTT 4620 

GTTGCGTGGA CGCTCGCGTG TCTTATTATG ATAAGCGCAC CGTGTCTATA CCGTCTTCTT 4680 

AAAGAAAAGG GAATTACCGC GCTTGAGCGA ATCACAGGTA TCTTGCTGCT CATTCTTTCC 4740 

ATCCAGATGT GTGTTGAGGG AGCCCGGGGC ATTATTGCCA CTTCCTAGCA AGAAGGAAAA 4800 

CTACCCGCTG CGTACGTGCG GGCTTAGGGG ACGACGACAA CGTTCGCGAC TCTGCCATCT 4860 

GCCAGGTATG CGCGGGCGTT GCTCTGGGTG TCAAAGGAAG AAGTGCCATC TTTGACGAAG 492 0 

GCATAGAGCC ACCTTCCAGG CGGGAGGGGA AGCTCTAGCT CGTAGTGGCC GGGACGCACC 4980 

TCTTCCAGAG AGTACATGAA TGGATCCCAG TTGTTAAACG TACCTGCAAG gTGGATAGTC 5040 
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TGTCCCGCTG CACCCTGGTA CACAAACCGA GTGCCCGCGG CCGTATGCTG GGTTTGATAC 
GATTCGTGAG ACGGCACATC GAGGTAAGAA ATGGaCATGC CATCGCGGTG ATCGTAGCTT 
TCGAAGCTAT TTTCAGGATC GGTAGTCCAC AACCCATCAA TCACAAGCCG G T AAC TT AAA 
CGCGAACACC cTTCAGGAAT AGGCGCGATA TGGAAAAGAA CGGAGCGTTC AGTGAGATTC 
TGGGCGCTCT CTTGACTGAG GCGCACGAAC GAGTATATCG GGCGGTACCn TTCGTGCTCA 
AACGCGATAC CCACGTGGCG CGCTGCCCCT GACGCAgTAA ACACGACGCA GCGCCCCTGA 
ATCCGAGGCG CTTCCACGCG GGAAATAGAC TCGATAAGCG CGCGGCGcTG CGTCGGATCA 
AGTCCAGCCG CGCAGAGTCC GACAGCACCA GACAAAACGA GCATGACACC AAGCGCACAT 
CCTCTCATCG AGTTTCTCGA TCCTCCCCGG CAAAGCGCAC CACCACGAAC ACACCCCCAT 
ACCACCGGTC CTGGCGAACT CGCAAAAGCG CGGCACACCC GAAACCCATA CCGCGCACAG 
CTTCGGGATA TGCATGCGCA TGCTAAAGGG AAACCTGTCC TCCTGGCAGA CTTCACTCCT 
CCACAAAAAA AACCGATACG AGGGCGGGGA GTATAACGCG CAATGCCGAG TGCACAACAC 
CTGTCAGAGT TTGCTCGCGA GCTCAAGACT CTTGGGAATG AGC C AG AC AC CCTCAAATCT 
TGGGGTACTC TGTACGATGA CCTACCACCT CCTGAATCTA CCCCCGACGG GGCACAGCCT 
GCGCCCACGC CTGAGCGGCA GTCCGCGCCT GCATCCGCGT C Ag cTTCTGG CCCTGTGTCC 
GCACATGGGC AGCGCcCCTT TGAGCCTGAC ACAGAAGCAT CGAGCGTTGC CTCGGGAGAG 
GAGGTCGTGC AGGAAGATGC GCACGCACCA CAGACTCGAA TGC ATGACTC CGCACAGGAG 
CCAGCGGCGG AGATTTCTCT CTTTTCTGAA GAGCGGACAC CGGAAACTAT GCCGACTGCT 
GCCTGGAGTG CACCACCGGA TCCTCTTTTT GAAACCGAGC ATGCTGTCCC CCCCCTACCT 
CTTGACCCGG AAGAAACACC AGTGCCCGGA GAAAAAGGTC TCCAGGAGTC CGCCGTGCAG 
GAGGAAGACG CCGGATTTAA CCAGATGCCT GCGACAGGAG GGCAAACCAG CGAGAATCAA 
CAACACTTTG AC GCATTGC T CGCCTCTCTT GATCTTGATT CGGCAAATGG CGAACGCGTG 
GTCCCCGAGA ATGCAGATGA GTTCGCCGCT CAGGTACCTG AATCCCTTCT AGAAGGGTTG 
CATCCAGAAG ACCAAGAGAC GAAACGCTCG CAAGAGGAAC CTGTATCCTA TGACTTCCCT 
GCGTTTGATC TGGACCAGGT AGCGCCTCCT AC AC C AG ACG CCCCTGATTC TTCTAACTCT 
GCTCTCACTG AGATTGAAAT CACCCCAGCG CTCTCTGAGC ACCCCACGCA GACGCAGGAA 
ACGGGTACCA CCTCGCCACA ATCGCAGACT GTGCACGCTG ATGCGTCTGC CCTAGGGCCT 
AGTGCCTCTG ATCCTAATTT TTCCCCTGGG TCTGCGGATA ACTTGGTCGC CCAATTCCCC 
ATTGAAGAAA GCGTGCAGAT ACCTCCTTTC CCCGCTGATG GCTTTGAACT TCCCGGTAAA 
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TTCCAAGAAT TTGCGAGAGA 
GCAGACCAAG CACAGACCAT 
GCCCTCCCCC TTCCTGTACG 
GACAAAGAGG GTTATGCGCT 
GCTACTCAAC TCGAGCACAT 
AAGTCAGCTG CCGCACACGA 
GTGCTTCCCC TGACGGCCAG 
CTCTCCTGGC ACTTTCTGTA 
CATGCATTAG AACTGGACCG 
TACTGGAAGA TAAAACACTG 
TATACACGTG CTGAACAAAT 
GGGAGCATTG AATATGCGCA 
ACGACAtGCG TCGGCAGGGA 
TCGGAGACGT ATATCTAGAG 
AAACATACCA ATCACTCATC 
TGCGCTATTT TATCAGAACA 
CCAATACGCG CGCTAGGATC 
AGAAACGCTA TGAATCTCAA 
TGCGCGCATT ACTTGAGCGG 
ACCTTGGAAA ATTCTTTGTC 
AAGCTGTCAA CCGTTACCCG 
TTGACGCGAT GCGCCTGCTC 
GCGAAATATT CACCCAGGCA 
CGCCGcATCG GaCTATTGGA 
ACAAAAACTA TGACTCTGCG 
CTCCTGAGGT TCAATACAAA 
CGATTCGGGC AATGAATGCA 
GATTCGGCAC CCTGTTGTGT 
AGTTACTTGA ACTGTTAGAT 




ATCTGAGAGC CCCTATTTCA 
AAGCGAAACG GAATATCAAC 
TATTGCGGTT CAAGAATACC 
CATTAGCAGC ATTGCAAACA 
TCTAAAAAAG CCGCTGCATA 
ACGCGAGAAG TCTTCCCTTC 
CTCAGCGGCC ATACTCATTT 
CAAACCCCTT CATGCGCACC 
CTACGAAGAT GCACACACTA 
GTACTTTCGT TATGCGCGTG 
TTACACCGAG TTACTCTTTG 
CATGCTCTGC AATGAGcTGC 
CTCGACCATC ATCCAAATGA 
TGGGCAGAAG AGGACCCTGC 
GCTTCCCACG GCACGCGCGA 
GATCAGCTCG CGCAGGTACT 
GCTCCTGAAG ATTTG AC AG A 
CCCAGTGACT CCCTTACATT 
GCCTTTAAGG CGGATCCTAT 
TACAATCACC GCAAGGACAG 
CACATGCCAC ATTCCACAGT 
GGTACGTTAC TCCTGGAGGA 
CTTACGCGcT ATCGCAGCTA 
AAACTGTACC GTG a CTATGC 
TTGGAGCACT ACCAGCATGC 
ATAGGGTATA TTCAGCACAA 
GCGTACGAGC ACAATCCTCA 
AAACGTGGTG ACTACTTTGC 
GCGCAGcGTA CAAGACGCGG 
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GTCcTGATAC AACCGCCGAC 6840 

GCTTTCTCCA GCGGCTCGAC 6900 

TGTCCTCAGA GGAGACCTCG 6960 

ACGCCTCGCC AAAAGCGGTT 7020 

TTCCCAGAAA GTTTGAACGC 7080 

CCTACATCGC GAAACACACG 7140 

TCATCCTTTC GCTTGCAGTC 7200 

TGAGCTACCG CGCAGGGTAC 72 60 

ACTTTGAACA CGCCAAACAG 7320 

CCTTACGTGA CAAAAAACAA 7 380 

ATTTCCGGCA TCCCAAACAG 7440 

GCAAATACGA ACAGGCAGAA 7500 

TCCTGATATC CTCAGCGCAC 7560 

TCAATACGAG CAGGCTCGAA 7620 

TGCGTATCTT GCACGCATGA 7 680 

TCCTCTTAAG GCACACTTTA 7740 

ACTCAGTGGA TACCTTTTAG 7 800 

GCAGTCAAAG ATTGAGGATC 78 60 

GTCTGCGGAT GCGGCTTATT 7920 

CGCGCGGGAA CTCCTTCAGC 7980 

CAGGCGTACa CTGCGTGAAA 8040 

AAAGGGACAC GCTGCTGCCC 8100 

TATCGTAATG cGTGaCCTAC 8160 

AGATATGGAC TACTTTATCT 8220 

GCGGGCGCAG TTACTTGATA 8280 

AAAAAACAAC TACCCCGAAG 8340 

GGATAAGCAC C TTTT AT ATG 8400 

TTCCCAGGGG TACTACGAGC 8460 

TGTCATGCTC CCCCACATAG 8520 
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AAAAGGCGGA CGCCGCGTTT GTTGATTTGT 
TATTGCACCG TTTGGCAACG ACTCATGGAG 
TGTTTGCAGA ATCCTCTCGT GCATGGGACG 
GCTCACAAGC TACCGGTCTT TCATACCTAA 
AGTTTCAGCC AGAACTGTAC GACGACATTC 
TCCAAAAGGA ACAAGAGAAC TAGCCaACGG 
TAGTCTCCCC TGAGAGGAGG CGACTGATGG 
GATACAATCC AGAGGTAGGG GATGCAGACG 
CCTACCGGAC CACTGGGArT gTCCACTCTG 
GTAGCTCTTC TGCCTAGAGG AAAGGGGAAC 
GGTACGATCG GCTACGACGG TCAACGGGCA 
AGCAGGAGTT CCCTGCAGGA ACTTCTCAGT 
GmGCCGTTTG GGAACGCTCG AmArAAGCAC 
GCTCAGGGAG CAGGTACAGC GCGCAGGACA 
CAGGGGAAAA GGCGGTTGTC CTCTAGCCGC 
TCAGGCTTAG CCTGAGGGGG TGCAGGnTTT 
AATCGTGAAA CCTTCCCCTT TTCAGCGCGT 
ATCCCCGGcA GCACCACGCG CACCGcGTgC 
CGCCTCGCGC ACTTTAACTA CCAAGCGGAC 
TGTATACGCC AAAGCACCCC CCGCGTTTCC 
ATCCCTGACA CCTTCCCGCC ACCAGCTGCG 
GAAGACGAAG GAACAGGCTG CGCGTGCGAG 
CCATAGGGAA CCCGACTGGG AGTCGAACCC 
TACTCCGAAG CGGGGACATC CGTTGTATTC 
CTCCTTCCGA CAGGAGCTGG CGGTGGATTA 
GACGTGGGAG CAGTAGGCGG AACACCAAAG 
TGCCTATCGT TGCGTTGCTG TGAAGCGTGC 
CGCGCCACCC CCGCGTTCAG CATGTCTAAC 
TCTCTATTTG CAGCGTAAGG TCCCCGATCA 
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ACATGCGCAC GTGTAATAAC 
ATTCGCGGAA AAATGCACGG 
CACTCACCCG TCACCCTGAA 
ACGTCCATCA CATGACACGC 
CTCTCCTACT TGAGCACGAA 
TGCCCGCTTG CCTGCATGAC 
GAACGTACAT GTGTGATTTG 
GGGGCATTCC CGCGGGTA t G 
TGGGGTGGAC AAGACAAGTT 
GATCCAGTGA AAAAAAAGGA 
GTAGTGGACA GGGCCCGCGT 
GCGGGGGCCT TcCGAaGAAG 
TGGAGGCCGT CGCCTCCGCC 
TCGCAAAAGT TTTCGGCATT 
CTCCTCCTTT GCTGAAGATC 
CCCACTACCA ACTTTCCTGG 
GCTTTACCCT CTCTAGGnTC 
gTCTGTTCGC TCATATGAAG 
CnATTCTCCT C AC G AAC AAA 
CCCGAATGGG TTGGCGTATA 
TAGGGAAGAG GCGCAgCAGC 
TTAGGTGCAG CGGAGCCAGG 
GGTGCTGCGT ACGCGACAGG 
GCCACTCCCG GCACACCCGG 
TGAGGATCCG CATACATGAC 
GAATCTTGAG GTAAGACACC 
GCATTCGGAT CTGCCTTGTG 
GCAACAGCTG CAGCCTTTGA 
TTGATGCGCA CGATT AC C TT 



PCT7S558/13041 

CTGGGCGTAG 8580 

GCGTTAACTC 8640 

ACCAGGGTGC 8700 

CCCTACACAG 87 60 

GAACCGCCCA 8820 

CGAAACAGGG 8880 

TGTGGCTGGG 8940 

CGTTTGAGAA 9000 

TTGTGAAAGT 9060 

CGCTTTCGTC 9120 

GCTGAAGCAC 9180 

gCGGCTGCCT 9240 

TACAACGCCC 9300 

GCCTCCGAAC 9360 

CTGCACCCCC 9420 

CGGATAACGT 9480 

CGCGGGGCGT 9540 

GATCAAAGCC 9600 

GGCTCCCAAC 9660 

CACCGACTTC 9720 

CGCATAGGAA 9780 

CACGGCCGTC 9840 

CGGCGCACCA 9900 

AGTTCCAGCC 9960 

AGGCGCACTG 10020 

AGGAGAAGTC 10080 

TATGGAGACG 10140 

CACGTCAATC 10200 

TTTGCCGTTG 102 60 
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TCCAAGTTCG TCAACTCCAC AACCGTACCA AAGGGAAGCG TGCGGTGCGC 
GCGTTCATGT CAAAAATCTC CCCACTTGCG GTAGGTCTTC CGTTAAAAGA 
TAGGAAGCAT ACCCTTCCGG AACGATTACC TCGCCGGCTG CAAAAAGCAT 
CACAATACTG CAGCCACTGC AACGACACGC TTGTCCATCA TTCAACACTC 
CTTCACCCGA GCAAAACGAT GCTTCACAAG ACACCCCCGA CGCTTATCGG 
AAAAGGTTGA AATCTTTTAA GGTAGGGGCG CAGTGGGTTG CTGGAGACGA 
CGTACGACCG CTGCCGGTCA AGGGATTTTA AGTCCCTGAT GTCTACCAAT 
CAGCGTTGTG CGGCCGTGCT GCCACTGTAG CGGGTAAGTA GCCGGGAGGT 
AGTGATTCGC ACTGCCACCT TGCGCTGCTT GTAAAGCAAA GTGAGGATCC 
TTCAGACAGC TGGACCTTGC ACGCTTCCCG TTTCTCATGG AGGTCGGTAC 
GATTATCAGG AAAGAAAGGC TCTGCTCCTG CAGGCCTGCG CGGGGCGCTC 
TGTCTGCACT TCTCTGTAGG GGTCCGGCCT GCGCCGGAGC CCATTGCGCA 
GCCCTTTCAC TGCTTCGGTC AGACGTGCGT GCCCTGTGCG CAGAACAGGC 
GCCTTGGGTG AATGCGGTCT TG AC CG AC AC TGGAATGGCC CTCAGGTAGC 
cGGAAAGGAT CTGGTGTGCG CGGTACACCA GATCTTGATG CAGAGGAGTA 
GCACAGCTCT CTATAGCGAA AGcTcAGAAC CTGCCGCTCA TCATTCATTC 
TTTGAACCGA CACTCCGTTG CCTGGACTCA GTGGGGTGGA GAAAGGGTGT 
TTCTCGTACG GATCGTTGAG GCACACGCTT TTTTAGAACG TGGTTTGTAC 
CAGGCACACT TACGTACGCA AAGACGACAT CCGAACTTCT CGCGCGCGAT 
CGGAGTATCC CTCTGGATCG TC TATTGTT A GAAACGGACA CTCCCTACCT 
CCGCATCGAG GAACACACAA CAGACCCGAG TATGTCCGAC ATACCTACGC 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2450 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



PCT^^/13041 

GGCAGTATAC 10320 

CTCCGCATAG 10380 

CTGCACGTTC 10440 

CTTCCAAAGG 10500 

AATATGGACA 10560 

GACTTGAACT 10620 

TCCATCACTC 10680 

CAACATATAC 10740 

TCAATCCCTC 10800 

GCGCGCAGGG 10860 

GCTTCCTTCT 10920 

TCCTGAGACA 10980 

GCCCTACCGA 11040 

GTGCAAAGCA 11100 

TCTTTTTAAG 11160 

ACGGGACGCT 11220 

GATGCATTGT 11280 

ATCTCTTGTG 11340 

GCGCTTTATT 11400 

CGCTCCAGTA 114 60 

GTTGGT 11516 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
CACGCATGGG CGCAGAC AT t GGgTtCATTG GAyTTGCTGT CATGGgAGAG AATCTGgTTC 60 
TCAACATgAG CGCAACGkTT TTTCCkTCGC AGTTTTCAAT CGCACCAcCA mGGTGGTCGA 120 
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CCGATTTCTT GCAGGGCGCG 
ACTTGTTTCA CTTTTGGCAC 
AGTCGATGCG GTCATTGACC 
CGGTGGCAAC TCTCATTACC 
TATTCATTTC ATTGGCACAG 
CCTCATGCCT GGAGGCTCTG 
TGCCGCCAAA GCCGACGATG 
GCTACGTGAA AATGATTCAC 
GCTACTGGTT TATGAAGCAT 
CCCGCTGGAA CACGGGCCcG 
GCACATCAGG ACACAGACGG 
AAGGGGACGG GCAGGTGGAC 
ATCACAGAGT CAGTGATGGC 
CATCGCGTTT TTGGTTCTCC 
CGCGAAGAAC TGGTGTCTGC 
GCGCAGGGTT TTGAGCTGTT 
TCCCGGaTTG CATCGCTGTG 
ATCAGTGCGG CGTTTGCTCA 
GCAG AGG r AT TAAAGCGTGC 
CAGGCGTTGC CAGTTCCGGC 
CTGCTTTGCC GGCCAACCTC 
AGCGCACAGA TGCGCCGAGA 
" ATACC ATTGC AGGAACCTAC 
TATTTATATT CCCAGGTGAT 
CGGCGGATGT TGTATAACGG 
TCCCATCATC CGCTTTCCTC 
GCCCCTCGGA GGGGTAGGGT 
ATCAGCGGTC TTCCCCCCTG 
AATcTGTAtT CGTACATGGC 



CTCATGGCAA 
GTCCACGCAA 
AGATACTGCC 
AGGATACCAT 
GAGTTTCGGG 
CTCAGGCTTG 
GCACCCCGTG 
AACGGCATTG 
GCGCTGGGCA 
CTTACACTCG 
CACACCACTT 
GTGTGTTGCA 
GCGTAGTCTT 
CGTGAAAGTC 
ACTGGAAGAC 
ATCGCATACG 
GCGTGGCGGG 
GCAG CA C GAT 
GTGTCCAGGC 
CCTCTCTGCT 
CTTCAGGCAC 
GGAGAGTTTT 
TCAATATAGG 
CTTGACACCA 
CTATTACCCC 
CCTACCTCGT 
GCATTCGGGG 
GGCGCAGGAG 
AATATCCGTG 
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GCGAATCACC 
AATCATGCTC 
CCTTCTAGAA 
CCGGCGCATG 
GGGAGAAGAG 
GCCGTTGGTT 
CTGCGACTGG 
AGTACGGCGA 
TGAGCTATGA 
TACCTGATTG 
TTAGAGAAAA 
GCGCTCGAAG 
TCTGCGCAAA 
TCCAAAGCAG 
GCGCTGTATT 
GCAAAGCGCC 
TGTATTATTC 
CTAGAGAATT 
TGGCGCACCA 
GCGTTACCTG 
AGCGAGATTA 
TTCACACAAA 
GGATCCTCCC 
CCTCGGGTGC 
AGCCTTCCAA 
TGATTTTTCT 
CAGCAATTaC 
TTGTCCAAGA 
ACTTCCTCCC 



GGCGCCCaCT 
ATGGTCAAAG 
AAGGGGGACC 
CATGCGCTAG 
GGGGCCCTCC 
TCTCCCATTT 
GTCGGCAGTG 
CATGCAGATA 
GCACATGCAC 
AGATTACCGC 
TTCTAGATGC 
AAGGCAGCCC 
AGCAAGCGCG 
AAACGCTAAG 
GCGCGAAAAT 
G AGG A TGG AC 
GTTCAGGATT 
TGGTACTTGC 
TAGTGGCAGA 
GTTTGATGGG 
TTTTGGTGCG 
CTGGACAGGC 
GTCGCTTGCC 
TGCGCTAgcA 
GCTGGAGACG 
GTTCTATACG 
TtGAAAAGAA 
AGTATTGCTC 
CCATCGCGAT 



PCTj 
CCATTGCAGA 
CAGGCAGCGC 
TCGTTATCGA 
AGGCCGCAGg 
GTGGACCGTC 
TCTGTGCCAT 
ATGGCGCCGG 
ATCGCCGAGG 
CATACGTTTA 
GGCTATTCTG 
CGCTGGACAG 
GCTTACACTG 
CTGCAAGGCA 
TGCACAGCAG 
AGTCTCGTAT 
AC TGG aTTTT 
CCTGTCCAAG 
TCCCTTTTTC 
ATCGGTACGG 
TTCACCGGTG 
CACACCTACG 
ACCGGCGGTG 
TTTCGTTCTA 
TGCGCCCGTC 
TGGGTTCGAC 
CGCTACACTC 
CAGTATTATT 
TAAAACGGTC 
ATTCAGGGCA 
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180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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GCTTCTCcTT CGTTCGCATT AATGACTACA TATCTGAGGT TATCTTCGGT AACCGGGrTA 1920 

TCATCGTCTT CTATGACAAA TCTGCAGGGC TCACATTTTG TCTACAAGAA ATGCTGAGCG 1980 

CTTACTTAGA GCGTATGcAT GCCCAGTATC CTACTGAGGC ACTTGCTGAC TTTCTTTCGC 2040 

GTGATCCGGT GAAAGCTTTT GCGTACCTTG AGCGCTACTT TATTATGAAC ATGAAACAGA 2100 

ATAAGCGTAT GGTCCTCATC ATCGACTATT CTGAATCTCT CGTTCCCTCA GAAGATATTG 2160 

CAAACTTAAG CGAAACAGAT CGCTATTGCT TCGTCACCCT CAATCGCTGG GCAAATGATC 2220 

CGGTGTTCAC AAACGAAGAC ATATCCGTTG TGATGCTCAC GGAGAATATC ACTGACATCA 22 80 

ACAGTCGGTT CACCGCTTCT CCTTCCACCG TTAAGATTCA CATACCCCTG CCAAATGAAG 2340 

AAACACGGAT ACGCTTTCTT GAATATCTCA AAACCC AGG A GGAGATTTTA GTACTTGAAC 2400 

GTGGGTTGAA TACGGAGAAA ATTGGCAAAC TCACTTCCgG TTTGAATTTA 2450 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

AGCCTTTTGC TGCCGTCAGA GGACGTGCGA ACTTCATAGG TGCCCGTTTG GAAATTACGT 60 

ACCACACGTG AACGACCGGT GGTGCAGCTG TAAACTGCAG CACCACGCTG TACGTACTGA 120 

. GGGATTTCGT ACGTCACACC GTCATCAGCG GTAAAATACG CnCGTGCGCT GCAAATGCCA 180 

GCGCGTCATC ACGCCAGTAC CGAAAGCGGT AAAAATTCGC CTCGGTGAAG AAAAAGGCAT 240 

TCAGTTCTCC AGGCGCCTGG CTAGAATACG CACGCAACGC CTCAGCAGAC CCTTCCTCAA 3 00 

TCCAAAAACC AAAAAGAGGA TCGTCTGTTC TCACGGGAAT CTGCTCGGAC GAGACGGACC 3 60 

CGGAACGGGT ACCATACGAT ACCCGCTGAA AAAAAGAGCA GAACAGCGCG TCCCCCCAGC 420 

GACACAGAAC GAGCGGyTGC ACACGCACAC GCCCCTCAAA CGACAAGACA AGATCCACCC 480 

GTTCTGCCTT TTGCGCCGCA GGGGGCGGCA GCACATCAGC ACGAGCATCA GACAAAGATG 540 

AAAGGGAAAA GACAGCCACG CGTTCATACA CGTACGCATA ATACGGCTTG TGCACAAGCA 600 

TGAGCGAAAA ACGCATCGCA TCAGCAGTAG GAGCGATAgC AACGATAcGC GTGGCGTTCT 660 

CCCACACCCC TGCCAAACGC GCCAGAGCAG GCTCAGCTGC CAACCCCACC TGAGTACTGA 720 

GCACGCACAG ACGCACCCAC GCGCTCCACC CTTTTCCTGT CATACCGCGT AACCGGTCAC 780 
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TCAAGCGGCG CAAAATACCC TGTCTCGTCC CGCCCCGAAC GGCTCAAGAC GCGCACTTCT 
GCCTCCACCG GCATATATGC ACGACCAAAA TCCGTCGGCA ACCGCGATCG ATACGACAAT 
GCATAGGTGc CCCGctGCGC CTGAGCGCAC GTGCGCCACA AGAGCCCCTA ACCCTTCTTG 
CGCATACACT GAATACACCG CACCCCTTGT CTTTTGAACC AGATATCCAA GCTCCACGGG 
CAATACCGTA CGCTGAAGCT GTATTACGTA AAAGCTCGTT TCATTATTCG TCAAATATGC 
AGCCAAGTCC GTCACCCCAT ATTGTGCAAA ATCCTGTGTC TGCAATTCAC CAAGAGACAA 
GAATACCACT GCGCGCTTGT AATCTGCGTT CACCAGCGTG CCTGCCGCAA GACGCAGACC 
CAGATCAAAA CGCCACTGCG AAGAACTCTT TGCTCTTAGG CGAAGcGGTT GCGcCTGCAG 
tGCGCCGCAG TGAAGGTGCC TTCCAGCACT GGTGACTGTG C CGC AGAAAC TACCGACAGC 
GTCCCCTGTT CCCCCACCGC TTGTGCAAGC TCACCAATCA CCCGTGTAAC CAAGCGCATC 
TCCTGCTCCG TCGCAGGAGA GCGGTCCACC AGTATACTGA GCGAACAcGT ATCGTTCAAa 
TACGCTGCCC CcTGCAGACG CATCTCACTT ACCGGCCGGT GTTCTTCGGT AAGAAAAAAG 
TTGGAAACGT CCAATCCCAC TACCGGCGTC CCCTCACGCG TGTGTACCGA CACATTCACA 
GTAACGGAGG GAAACCGGTC GGCGTGCACC CGCTCAAAAT GCACAAACAG ACCGCCGGCA 
AGCTCAGAGA TACGCGAAAC AATCTCAATC CTTTCGTTTT TATAATCGGC AAGGAGCACA 
TTGCCATTTG CATCCGGAAC TGCCGCCGTT AAGCGAATAG GCGCATTTCC CAAACGGGCA 
ATCGTGTGCA AGGACGCGAG TCCTACATCC ACCACCATGA CCTCATTCGG GAGAGACACC 
AACAAGCGTC CATTCCACGC CCGCACAGAT TCAACGTGCT TGAGCGTCCC TTCCGCAACG 
AGGGTACGCA CATAATTTCC TGC CGTATC A AACACGTAAA TGGCGCCCTT CAGAGCATCT 
GCCACGTACA CTAGCTCATC GAGAATGGCA ATGCCGCCCG GAGCAGAAAA CCCAAAGAAG 
CGCGCAGACT TCTGCCCAAA ATGGAAGAGA GGAGCACCAT C AGG TGC AAA CACCGCCACA 
CGCGCATTCC CAAAATCTGT C AC GT AAATG TTATCGTAGC GATCAGTGGC CAAAAACTGG 
GGGCCGATGA GTTGTCCGAC GCCTCTCCCC TTTCCCCCAA ATGACTTGAG GAACCTGCCT 
TCCTTC GTAA GACGACAAAT GCGATCAGAG GCAAATTCAG AAACGAGCAG ATCGCCTGAG 
CGCGTCTGAA TAACATCAAA AGGACGGTCA AAACCCTCAA CGGGCCCACG CGTACGcgCA 
ATAACACGTC CGTTCACGTC AAAGCGAAgc AGCTCGTTAG AACCGTATGC GCTCATCCAA 
AACGTACCGT cAGcTAACGC ACACAAAGAT AGTGGTCTGC GGAAAAGAAC CGTTCCCCGG 
CGTACAGCAT GAAACGATTC ACTTTCGCTA AAGTGCAGCG CGTCTGCTGA ATCAGGCGCA 
AAGTCACGCC GCTGCTGAAC CACTTCTATC TTGTTCCGAA GCAACGCGCC GCCGTAGCCT 
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AGATCCCGCG CCGCGCCCCA CTGGTGcAGC GctGCGCCTT CAATCCCACT GCGGTAGTAC 

GCATTCCCCA ACCACTCAAG AATGAGCGGA TTACGGGGAG CAGCAGAAAG CGCACGCTCA 
AACAGCTGGA TAGCATCATT GAACGCACCC CGGTAATAGG CCAAAACTCC GCGGCGAAAC 
TCCCCTGCTG CAAGTGCTGT ATCACGCACA ACCGGTGGCG CATGCTCCTG CGCCCCCACT 
GCAAAAAGCA ACAGCAGCGC GCCTGCGCAC CCCACAGATC GTCTACTCAA ACACCAACCC 
CCTCTCACTG CCTTTCAGCG CAGTCTCTTC TTTCTCCAGA AAGCTCACAA AAGGTGCACA 
AACAAAAAGC AGAGAGAAAA AAGGAGCACG CAGGCCAAGA CAAAGAGACT ACCTCGAACA 
GACGCACACC ACGCCCTATC CTCAGTACGA GCAACAAGCC TGGAACGCAA AATCCGGCAA 
CGGCAACACA GGAGGCATTG AAACCGGCTG CGCATACACA AGCGTAATCG CAATGTCACC 
ACGCATTACA ACCGCCCAAT CGCTGTCCTG CACCGCGCCT GTATGCGCCG CTCCACCCTG 
GTACTCAAAC ACGTGAGCAA GGCCCGCCTG GCGCACCGCC CCACGTTCCT CATATATCCG 
ACGcGCAACG CGCGC t AACA AGCACGCGCA TACGCCTGTG CCGATCGCGC GTTTATGTTC 
ACCAACACGT CCTCGGAGGA AAGTGCAGCC AACGCCCGCA CGTGACGCGC CACAAAGCGG 
GCAAGCGCCT GGAAACGGCA GCCTGTTCCA AAATCAGAAA CGGGCAACAG GCTCGCTGCA 
ATCCCCGCAA TCCCATACAT CACCGCCTGC CGTGCTGCGG CAACCGTTCC CGAGAACACA 
ATATCAGTCC CCAGATTCTC CCCTTCGTTA ATTCCTGACA CCACCACATC CGGCGGTGTA 
CCCACGCACA CCTGGCGTAA CGCGCGATTC ACACAATCCA CCGGCGTCCC TGAGCACGAC 
CAAATACCTG GCTCCACTTC CTTTACGGTC ACCGGCTCGA GCGTAGTAAT CCCATGCGAA 
ACTGCAGAAC GATCTCTGTC CGGCGCAACT ACCGTCACCT CATACCCCTC AGGCGCTG t T 
TCAGCGCCGC aTGCAGCGCG CGAATGCCTG CTGCCTGATA CCCATCATCG TTTGTCAGTA 
GTATCCTCAT AACACCCGGG CCCCTTCAGA GCACTGTACC TCATACGCCG CTGCTTTGAA 
ACCGAAGATG CGCTCGTACT CGTCGAGCCT TTCTAGGTAC GGCTCAAAGT CTTGATCGCG 
CAAAATCGCA TAGGTGCACC CACCAAAGCC CCGACCCGTG AGGCGCGAGC AG AC C AC ATC 
CGGCGCATCA GGATCTACAA ACTCAAGCGC ACGCTTCACC AACCAATCGA GTTCTGGACA 
AGAAATTTCA AAGCGGTCCC GCAGGCGCTC ATGAGAGCGG TTCACTACTC TTGAGAACGC 
AGCAAAATCC CGCTTACGCA GGGCTTCAAT CGCCTCATCA ACGCCCAGCG ACTCGCGCAC 
CAAACTGATC ACTCGCCTCC GTATTCCCTC AGGCACATCT ATTTCCTCCA ACGCTGCTGC 
CaTGAGCTTA GACATAGCGC GAGGCATATC GGGATTGCGC TTCACCAATT CATAAGCATC 
CACGCAACGC TTCAAACGCG CGGTGAACTC CTCACGCGCG ATGAAACGGG GAACACGCGA 
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GTCAGTAAGC ACAATACGCT TCCCCTCCGA GGGAAATTGA CACAGTTCCG CCTGCTTCTT 4320 

GCGGTGATCA GTGCGCACGC AGCTACCCTG CTTTGCAAAC AACACGCACA GAATATCCGC 43 80 

GCGATGTGCG TGGGTCTTGA GATAGCGCTC ATTTGCGTGT TCCACGATCG AAACAACACT 4440 

TTCCTTTGGC AGCGTAGcGG CAAACAACCT TCCAAGCACA AGGGCCATGG CAACCTTCAG 4500 

CGCATTGGGA GTACCCAGCC CCGCATCAGG AGGAATCTGA GAAAGGATAG TGCAGTTCAA 4560 

CCCCGTCAGG TGATACCCAC CATCCATGAA GGAGAGAATG ACCGCCTTTA CCGAATTAGC 4620 

CCAGCGATCC TCCTTACGAT AGCGTAAATT AGCGGTGGAA ATCTTCCTCC GCTCCCCAAG 4680 

CGTTAAGGAG AAAAGGCGAA AGGTGCTATC CTTTCGGCGC GAGACACACA GCGTAAGGGT 4740 

TTGATCGATA GC CATC G AC A GGGTGTTGCC CtGAGCAAAC CACAGATACT CCCCCAACAG 4800 

GTGAAAACGA CCCGGAACGA CTGCAATCGC CTCAGGCTCG TCGCCGTACT CCTCTGTGTG 4860 

GCAGGACTCT AyCcCGTGCA TGCGCAGCAT CATAGcCAGT GTATTGAAAT AATACAACAA 4920 

AAATGCTTTT CTGGCAGGGG AAAGTTATGC TTTGCACAGC GCCTCTTGTT TCAAGCGCCG 4980 

CCTCGGCGGT GCTC TTGGC A TTTGCGATTC CCAACGAGTT TTGGCTCGCC GGTTCCTCCG 5040 

TGCTAGGGTT GGGGGCGCTT GTTCCCTTGT ACGTTGGATT CCTCCTCTCC CCTGCAAAAA 5100 

AACACGTTGC CTGTTCTTAT GGGCTGTTCG TCGCACTCGT GCACGCGTGT TCTAGCTTTT 5160 

GGCTCAAAAA CTTTCAGGGC TTCGCGCTCT TCACCCTCGG CGCATCAACT GTCGGTTACT 5220 

TCTTCTATGC GCTTCCTTTC GGCGTAgcGT tCGCATGCAT CCTGCGCAAg CaGGCgCCCG 5280 

CGCGTGCCTG CGC TTTTGCG CTCGTGTGGA CCCTCTGGGA ATGGGTAAAG TCAACCGGTA 5340 

TACTCGCCTA CCCGTGGGGT ACGGTCCCTA TGACCGCGCA CAGCCTCTCG CACCTCATAC 5400 

AGATAGCTGA TATCACCGGC GTCTGGGGGC TTTCCTTCCT CATCCCGCTC GCAAACGCGT 54 60 

GCGTTGCAGA AAGTCTCCAC TTCTTCATAA AAAAGAGAGA CAGCGTCCCT GTGTTCCGTC 5520 

TCTGGCTCCT CACCGGCTGC TTGTACTGCC TGTGCAGTCT CTACGGTGCC TACCGCATCG 5580 

CCACCCTTGG GGCTCCACGT ACCACGCTCG CGTTGGCAAT CGTACAGCAA AATGCAGATC 5640 

CGTGGGATAC AACTTCCTTC GAAAAAAACC TCACCACCGC TATACATCTG ACTGAGACAG 5700 

CCCTTCGTAC GCAAACAGCT CCCCCCCTGC CGACTACTCC CTACAGAAAA GAAAAAACAC 57 60 

TCACACACGC TTCTGCGCgC GCACCTGTCG ACATGGTGGT TTGGAGCGAG TCTAGTCTGC 5820 

GCTATCCGTA CGAACAGTAC CGTCACGTGT ATAACGCATT GCCAGCGGcA CGACCTTTCT 5880 

CGGCGTTCTT GCGCAcGCTC GGCGCGCCCC TTCTGGTGGG AACCCCCTTG AG AC TGTCTG 5940 

GTAACTCCAC TAAAGGTGGA TACGCCAATG CAGTGGCCTT GcTCCGCCCA GACGGGCACG 6000 
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TGGCGCAGGT ATATGGCAAA ATGCAGATGG TGCCATTTGC AGAATTCATT CCCTGGGGAC 



6060 



ACATGACATC TGTACAAAGA CTGGCGCAGA TGCTCGCCGG CTTTTCCGAA AGCTGGACGC 



6120 



CAGGGCCAGG GCCGCGCTTG TTTCATGTGC CGTGCGCCGC AGAGGCAGCG TGCGCTTCGC 



6180 



AACTCCCATC TGTTACGAAG ATGCCTTTCC TTCCCTCTGC GCCGCTTTGC ACACACAGGG 



6240 



GAGTGAGCTC CTTATTAATC TTACGAACGA CTCTTGGTCA AAAACTGCCA GCGCAGAGTG 



6300 



GCAGCACTAT GTTGTCTCTC TTTTTCGGGG CATAGAGCTG CGTACCAACC TCGTGCGCTC 



6360 



TACAAAnTCT GGCTATACCG TCGTCATCGG nCCAGAGGGA AAAAnGCGCG CCGGTTTTCC 



6420 



GTTGTT 



6426 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2190 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

TGTGCGCAAC AGACAAACAC GTCCGGCAGG ACGTACTTCC ACAAGnAAGC GTTCCGTCAC 60 

GCCACAGGGG TAGGCACCAG GACGCGCCAC GTAGATTGCA CTCACTCCTT GCTTTTCAGA 120 

GGAAGGAGGT GATC CTGCAT CCTGTTTCTT TGTCTCACGT GCTGTGTCCG ACGCATGTAT 180 

GCAGTGGAAC GAAAACTCAC GTTAAGGGAT TTTGGTCATG AGATTATCAA AAAGGATCTT 240 

CACCTAGATC CTTTTAAATT AAAAATGAAG TTTTAAATCA ATCTAAAGtA TrTaTGrGTa 300 

AACTTGGTCT G AC AGTTAC C AATGCTTAAT CAGTGAGGCA CCTATCTCAG CGATCTGTCT 360 

ATTTCGTTCA TCCATAGTTG CCTGACTCCC CGTCGTGTAG ATAACTACGA TACGGGAGGG 420 

CTTACCATCT GGCCCCAGTG CTGCAATGAT ACCGCGAGAC CCACGCTCAC CGGCTCCAGA 480 

TTTATCAGCA ATAAACCAGC CAGCCGGAAG GcCGAGCGCA GAAGTGGTCC TGCAACTTTA 540 

TCCGCCTCCA TCCAGTCTAT TAATTGTTGC CGGGAAGCTA GAGTAAGTAG TTCGCCAGTT 600 

AATAGTTTGC GCAACGTTGT TGCCATTGCT ACAGGCATCG TGGTGTCACG CTCGTCGTTT 660 

GGTATGGCTT CATTCAGCTC CGGTTCCCAA CGATCAAGGC GAGTTACATG ATCCCCCATG 720 

TTGTGCAAAA AAGCGGTTAG CTCcTTCGGT CCTCCGATCG TTGTCAGAAG TAAGTTGGCC 780 

GCAGTGTTAT CACTCATGGT TATGGCAGCA CTGCATAATT CTCTTACTGT CATGCCATCC 840 

GTAAGATTCG CACTTCTAAG GCGTTCCAGA CTTCCCTTTC CCAAACTTTC TCTCAGGTTG 900 



Printed from Mimosa 02/03/22 07:24:18 Page: 416 



WO 98/59034 PCT/B158/13041 

415 

GCCTCAGTGG GCTCCAATCT GGGGCAGAAA AACCAGTACG AATGnATCCG ACACAAACCA 960 

GTCTAACGAG CCGGATGATG CGTCACAAAG GATGGAGCAC AAAAGGGAAA CGTTGGAGTG 1020 

ACAGAACAGC ATGGCAAAAA CGCGCAGGCG TTGGGTCGGA GCCAGAGAAC TGCGGTCGCA 1080 

TTAGCnCCTA ATTTTGCAGA ACTCTGTGGC AGCCAGTACG GGAGATAGGA AAGTTGCTCA 1140 

ATTCGCAAAC AGCACTTTTT TCTGACATTC CCAGCCTGTG GCCC ATAAAG GGAGGCGTAG 1200 

TCACATTTCC ATGGCATTTG GCAAGAACCG ACATCCATTT ACAGGGCAGT GGTATGTACA 1260 

CAAGGGTATT GATCTATCCA CTCACCGTTC AGGGGATCCT ATCGTTGCCA CTGCAGACGG 1320 

ACATGTGGTG ACGGTAGAAT ACGATTCGGG TTGGGGAAAC TACGTTATTA TCAAGCACAA 1380 

ACATGGGTTT TATACCCgcT ACGCGCACAT GCAATCCTAC ACCGTCACCC GTGGGCAGCA 1440 

CATCCGACAA GGACAAATCA TCGGTTATAT CGGCGCCACG GGTGTAGCGA CTGGTCCACA 1500 

TCTGCACTAT GAAATACATA TCGGCTCTGA CGTTGTCGAT CCTGGTAAAT ACCTCAACGT 1560 

CAAAACTGCA GGGGCAGGAT AGTGTCTCAA CAGGATGGAA TACATGGCAA AGATTGAGCG 1620 

TCGCTCCATG AACACGCTTA TTGGTGCAGG CTCCCGTATC AGCGGGAACG TTGTTGTCCC 1680 

CGGTTCAGTT CGCATTGAAG GGGATGTCGA TGGGGACGTT ATCACTACAG GGCACGTGGT 174 0 

AATCGGGAAG CGngcGcGTG TCCGCGGCGT CATACGGGTA GGGAGCATCA TCGTAGGAGG 1800 

AATGGTTGAA GGAGATATCG TTGCGTCAGA GGCGGTGCAG GTGCTCCCTT CTGGAGTTAT 1860 

TCTGGGCGCA TGCTTACCCG AAAAATTGTG GTGGACGAGC AAGCTTTTTT GGATGGTTTT 1920 

TGCTATGCAG TGGCAGATCA AGAGGGATTC AACAAAGTGC TCAAGGCCTA TCTCGGTCGT 1980 

AAAAGTATTC ATACGTCTGC GTTTgGATAC AACAAGTACA GCAAGTCAGG ATAAAGCGGa 204 0 

TGGGATATCG CGTAGGAAAT TCTGACTCTA CGTCTTTACT GTCCGCATTC GCTCCTCCTG 2100 

AGAGAGCCAA AAAAAAGTCA AAAGAAAAAC GGCCCCTGCA GGCTGCGCGC TTTCTCTCCC 2160 

TCCTATATCC TAAGACGGAn CCGCACTCTG 2190 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6570 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:. 41: 
CTCCGTATAG AGGGCCTGAG TATAGGCACG CCCCACAGGG ATTGTCAACG TCTTATGCAG 60 
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